Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0830 |
Symbol | |
ID | 5669246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 970323 |
End bp | 972332 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239759 |
Product | hypothetical protein |
Protein accession | YP_001505194 |
Protein GI | 158312686 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.56502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAACA AGCTCGCGGA GCAGACGTCC CCCTACCTGC TGCAGCACGC CGACAACCCG GTCGACTGGT GGCCCTGGGG GCCCGAGGCG TTCGCCGAGG CCACCACCCG CGGGGTGCCG GTGTTGCTCT CCGTGGGCTA CGCGGCCTGC CACTGGTGCC ACGTCATGGC GCACGAGTCC TTCGAGGACC CCGAGATCGC TGCGTACATG AACCAGCACT TCGTCAACAT CAAGGTCGAC CGGGAGGAGC GCCCCGACGT CGACTCGGTG TACATGGACG TCACGGTCGC GCTCACCGGG CACGGCGGCT GGCCGATGAC GGTGTTCCTC ACCCCGGCCG CCGAGCCGTT CTTCGCCGGC ACCTACTTCC CACCCCGGCC GATGCGGGGC AGCGCGTCGT TCCCCCAGGT CATGGCGGCC ATCGTGGACG CGTGGACGGC GCGCCGGGCG GAGGTCGAGC AGTCCGGGGC GGACATCGCC CGCCAGCTGG CGGAGGCGGT GGCGCCCGGT GGAGCGGCGT CCGGCGGTGG GGCCACAACA CAGATCACGG CCGACCTTCT CGACCGGGCG GTCGCCGGGC TGGCCGACCG GTTCGACTCC GTGCACGGCG GGTTCGGCGG GGCGCCGAAG TTCCCACCGT CGATGGTCGC CGAGATGCTG CTGCGGAGCT GGGCCCGCAC CGGGGACGGC CGGGCCCTGG GAATGGTGCG CGAGACCTGC GAGCGGATGG CGCGCGGCGG GATGTACGAC CAGCTCGGCG GCGGCTTCGC CCGCTACAGC GTGGACGAGT CGTGGACCGT CCCGCACTTC GAGAAGATGC TGTACGACAA CGCCCAGCTG CTGCGGGTCT ACCTGCATCT GTGGCGGGCC ACGGGCCTGC CGCTGGCCGA GCGGGTGGTG CGCGAGACCG CGGCCTTCCT GCTCGCCGAC CTGCGTACCC CCGAGGGGGG CTTCGCCTCG GCGCTCGACG CCGACGCCGT GCCGGCGGGC AGCCCCGGCG GCCATCCCGA GGAGGGCGCG AGCTACTCGT GGACGCCCGC CCAGCTGGTG GACGTGCTCG GCCCCGACGA CGGTGCCCTG GCCGCGCGGG TCCTGGGGGT CACCGCGGAG GGGTCGTTCG AGCACGGGAC GTCCGTGCTG ATGCTGCCGG CCGACCCGGA GGACCCCGCC AGGTTCGCCC GGGTCCGGGC GGCGCTGGCC GCCGCGCGCG CCACCCGCCC GCAGCCGGCC CGGGACGACA AGATCGTCGC GGCCTGGAAC GGTCTGGTCA TCGGGGCGCT CGCCGAGGCG GGCGCGCTGC TGGGCGAGCC GTCCTGGGTG GGGGCCGCCG AGCGCGCCGC CGAGCTGCTG CGGGACGTCC ACCTGCACGA GGGCCGGCTG TGGCGGACCA GCCGGGACGG CCGGCGCGGC CCCAACGCCG GTGTGCTGGA GGACTACGGG TGTGTCGCCG AGGGCTTCCT GACGCTGCAC CAGGTGACAG GCGCCGCGGG CTGGCTCGCG CTCGCCGGCG AGTTGCTCGA TGTGGTCCGG GCGCGGTTCG CGGCGCCGGA CGGCGGTTAC TTCGACACCG CGGACGACGC CGAGGCGCTG CTGCGCCGGC CGCGGGACGC CTCCGACTCG GCGACCCCCT CGGGCCAGGC GGCCGTCGCC GGCGCCCTGC TGACCTACGC GGCGCTCACC GGCTCCGCCG ATCACCGGGA CAGCGCGCGG GCGACCGTCG AGCAGCTCAC CCCGCTGTTG AGCCGGGACG CCCGTTTCGC CGGCTGGGCG GGTGCCGTCG CGGAGGCCCT GCTGGCCGGG CCGGCCGAGG TCGCGGTGGT CGGCCGGCCG GATCTGGAGC GTCTGGCCAG GCTCGGCACC GCTCCCGGCG CGGTTGTCGT CACCGAGGGC CCGCTGACCG CGGGCCGGGA CGAGCCGGCC GTCTACATCT GCCGGGACTT CGTCTGCGAG CTCCCGGCGC GGACCCCGGA GGAGGTCCGC GCGCGGCTGG GAGTGCGGCT TCCAGCCTGA
|
Protein sequence | MPNKLAEQTS PYLLQHADNP VDWWPWGPEA FAEATTRGVP VLLSVGYAAC HWCHVMAHES FEDPEIAAYM NQHFVNIKVD REERPDVDSV YMDVTVALTG HGGWPMTVFL TPAAEPFFAG TYFPPRPMRG SASFPQVMAA IVDAWTARRA EVEQSGADIA RQLAEAVAPG GAASGGGATT QITADLLDRA VAGLADRFDS VHGGFGGAPK FPPSMVAEML LRSWARTGDG RALGMVRETC ERMARGGMYD QLGGGFARYS VDESWTVPHF EKMLYDNAQL LRVYLHLWRA TGLPLAERVV RETAAFLLAD LRTPEGGFAS ALDADAVPAG SPGGHPEEGA SYSWTPAQLV DVLGPDDGAL AARVLGVTAE GSFEHGTSVL MLPADPEDPA RFARVRAALA AARATRPQPA RDDKIVAAWN GLVIGALAEA GALLGEPSWV GAAERAAELL RDVHLHEGRL WRTSRDGRRG PNAGVLEDYG CVAEGFLTLH QVTGAAGWLA LAGELLDVVR ARFAAPDGGY FDTADDAEAL LRRPRDASDS ATPSGQAAVA GALLTYAALT GSADHRDSAR ATVEQLTPLL SRDARFAGWA GAVAEALLAG PAEVAVVGRP DLERLARLGT APGAVVVTEG PLTAGRDEPA VYICRDFVCE LPARTPEEVR ARLGVRLPA
|
| |