Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5217 |
Symbol | |
ID | 5673551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6262963 |
End bp | 6264132 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641244071 |
Product | appr-1-p processing domain-containing protein |
Protein accession | YP_001509481 |
Protein GI | 158316973 |
COG category | [R] General function prediction only |
COG ID | [COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0611343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTTT CCGTGAACAC CAAGAAGTGC TTTGTCGTCA TGCCGTTCGG TGAGAAGGCG GGGCCAGACG GTTCGTTAAT CGATTTCGAC AACGTCTACC GGGACGTCAT CAGGGAGCCG GTCGAGTCGC TGGGCTTCAA GGTGGTGAGG GCCGACGAGA TCGAGCGGCC CGGCTCGATC CACAGCGACA TGTTCCGGCA CATCGCGATG GACGATCTGG CGATCGTGGA CATCACGACG GGAAACCCCA ACGTCCTCTA CGAGCTTGGC GTTCGACATG CCCTGAGGCC GTCATTGACG ATCATAATCA AGCGGCGTGG CACAAAAATT CCCTTCAATT TTGCGGGAGA GCGTGTCATC GACTATCCGA GCGTAAGGGG CAGCTACGCG GACAGTCGAG AGGAGATTCG AAGGTACATT GAGAACGGGC TGAAGAAAAG TGAGACAGAC AGTCCGATCT TCAATTTTCT GCAGGATGCC AGGAAGGATT GGAAGCGGGA GCGGATTACT TCGCGAGATG AATACCGCTA TCGGACCGTG AGCTCGCCGA AGAAGAAGAT CAGTGTGATC ACCGGTGACA TTCGTGACTG GCGTGGTATC GATGTCTGGG TGAACTCGGA GAACACCAAC ATGCAGATGG CGCGGTTCTT CGACCGTTCG CTGTCTGCGA TGATCCGATA TGAGGGCGCG GTCAAGGACG CGAGCGATGA AGTTGTCGAG GACACGATCG CCGGCGAGCT GACCGCGCTC CTCGGAGGTC GGGAGACGGT GACCGCGGGT GCGGTGTACG TCACCGGCTC GGGTGCTCTC GCCGCAACCC GTGGCGTGAA GAAAATCTTC CACGCGGCGA GCACCCAGGG CGTTCCGGGG AGCGGATACC AGATGATTCA GAATGTCGAG AGATGTGTGA CCGCATCGAT GCGGCGTATC GACGAGCAGT TCGCTGACGC AGGACTGAGG AGCATCGTCT TCCCGATGAT GGGAACAGGG GAGGGTGGCG GCGACGTCTA CGCCACCGCA CCGCGTCTGA TACAGACGGC CGTTGCCTAT CTGGCCTTCA ACCCAGACAG TGTTGTCGAG AAGGTCTACT TCTCGGCTTG GAACCGCCGT GACCTCGAGG CCTGCCTGAA CGCCCTGACG GATGCGGTCG AGGTGGAGCC CATCGGCTGA
|
Protein sequence | MSLSVNTKKC FVVMPFGEKA GPDGSLIDFD NVYRDVIREP VESLGFKVVR ADEIERPGSI HSDMFRHIAM DDLAIVDITT GNPNVLYELG VRHALRPSLT IIIKRRGTKI PFNFAGERVI DYPSVRGSYA DSREEIRRYI ENGLKKSETD SPIFNFLQDA RKDWKRERIT SRDEYRYRTV SSPKKKISVI TGDIRDWRGI DVWVNSENTN MQMARFFDRS LSAMIRYEGA VKDASDEVVE DTIAGELTAL LGGRETVTAG AVYVTGSGAL AATRGVKKIF HAASTQGVPG SGYQMIQNVE RCVTASMRRI DEQFADAGLR SIVFPMMGTG EGGGDVYATA PRLIQTAVAY LAFNPDSVVE KVYFSAWNRR DLEACLNALT DAVEVEPIG
|
| |