Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1204 |
Symbol | |
ID | 5669617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1439371 |
End bp | 1440387 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240136 |
Product | hypothetical protein |
Protein accession | YP_001505564 |
Protein GI | 158313056 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00102705 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.560918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCAG CGGACGAACC GGCGACCCCC GAGTCGGCGA GTCCCGGGTC GGCGGGCACC GCGCCGGCGA GCCCCGGGCC GGAGGTTCTC GAACCGGTGA CCGCGAAGCC GGCGGCCCCG GGCCCCGCGG CCCAGATCCC GGCGCCCCAG CAACCGGTGC CTCCGGACAA GTCGGCTCCG GGCGGGGCGG CTCAGGAGTC CCTGGGCTCG GTCATCGCCG CGGCGCGCCG GGCCGCGGGC CTGACGATCG ACGACGTGAG CGACCGGACG AGAATCCGCG CCTCGCTGAT CGAGCGGATC GAGCAGGACG ACTTCTCCGG CTGCGGCGGC TCCGTCTACG CTCGCGGGCA CCTGCGCAGC ATCGCGACGA CGCTCGGGCT CGAGCCCGGC CCCCTGCTGG CGGTGTACGA CGCCGGGCAC GAGCACGTGC CGTCCCCGGT CGTGGTCGCC TCGCCGGAGT TCGACCCGCT GCACGGTGGC GCGGGGCGTA ACCGCGGCCT TGGCGGCTTC CGCTGGGCAC CGGCAATGAT CATCTCGCTG GTGGTGGTGT GCGCGCTGGC GCTCGTCGCG CTGCTGCTGC CCTCGGGCGG GGGCGACTCG GACGACTCGG CCACGCCCCG GCCGAGCGCG CCGCCGAGCG CCCCGGCCGC CACGGCACCC CCCGGCCCGG CGCCGGCGCC CACCACCCCG CCGCCCCCCG GGGTGAACGT GCGGGTGGAG GCCCGCGACG CGCAGAGCTG GCTGGAGGTC CGCGACGACA GCGACAAGGT GCTGTTCGCG CAGCTCCTGC AGCGGGGCGA CAGCCGTGAG GTGTCCTCCG AGGGCGCCCT GGAGATCAAG ATGGGCAACG CGGGCGCCGT CGACCTCTCC TGCAACGGCA CGAGCCTGGA CCGGGCCGGC GGGCCGGGCG AGGTCGTGAC GATCCGGCTA GCGCTCGCCG CGACCGGCGG TGGCTGCACG GTCGGCGGGC CGGGCACGGG CGGCCTCGCG GCCGGCGGGC TGGCGGGCTG GCGATGA
|
Protein sequence | MQSADEPATP ESASPGSAGT APASPGPEVL EPVTAKPAAP GPAAQIPAPQ QPVPPDKSAP GGAAQESLGS VIAAARRAAG LTIDDVSDRT RIRASLIERI EQDDFSGCGG SVYARGHLRS IATTLGLEPG PLLAVYDAGH EHVPSPVVVA SPEFDPLHGG AGRNRGLGGF RWAPAMIISL VVVCALALVA LLLPSGGGDS DDSATPRPSA PPSAPAATAP PGPAPAPTTP PPPGVNVRVE ARDAQSWLEV RDDSDKVLFA QLLQRGDSRE VSSEGALEIK MGNAGAVDLS CNGTSLDRAG GPGEVVTIRL ALAATGGGCT VGGPGTGGLA AGGLAGWR
|
| |