Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2201 |
Symbol | |
ID | 5670600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2633338 |
End bp | 2634351 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641241121 |
Product | hypothetical protein |
Protein accession | YP_001506542 |
Protein GI | 158314034 |
COG category | [S] Function unknown |
COG ID | [COG2135] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.229787 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGGCA GGTATACCCA GACTCTGAGC GCCGACGATC TCGCCGCGGC GATGTCCGCG GCGGACGAGA CGGGCGGCCG GGTGACGGAG AGCTACAACG TCGCCCCGAC CACGGTGATG CCGATCGTCG TCGCGCGTCG GCCGCCGGGC GACGCCGGGC ACGGTGAGCC GGCGGCTGGC GTTGTCACCG GCGGGGCCGG CGAAAGTGGC GAGGCTGGTG AGACAGGCGC GGGCGGCGGG CCGCGGGCCG CGGGCGGTGG GCGGGTCCTG CGGCTGGCCA CCTGGGGTCT GGTCCCGTTC TGGGCGAAGG ACCCGGCGAT CGGTAGCCGC CTGATCAACG CCCGCGCCGA GTCGGTCGCG TCCAAGCCGG CGTTTCGCGC AGCGTTCGCC GCCCGCCGGT GCCTCGTTCC CGCCACCGGT TTCTACGAGT GGCGGCGGCC CGGCGGCTCG CGCCGCGGCC AGCCCTACTA CATCCATCCG GCCGGCCACC CGGGGGCCGA CGGCCTGTTC GCCTTCGCCG GGCTCTACGA GGTCTGGTCG AAGGGCGAGC AGCCGCTGAC CACGTTCACG ATCCTCACCA CCGACGCGGC CGCGGGCATC GAGTTCATCC ACGACCGGTC ACCCGTCGTG GTGCCGCGCC CGGCCTGGTC CCGGTGGATC GACCCGACGC TGCGGGATCC CGAGGCGCTC GCCGGGATCC TGCGGCCCGC GCCGGCCGGG GTGTTCGCCG CACACCCGGT CTCACCCGAG GTCGGCAGCG TCCGCAACAC CGGCCGTCAC CTCGTCGACC CCGTCGACGT CGACCCGGAG GAGGCAGCCG CCGCTGCCGG TCCGCCGGCC ACCGCCGGCC CGTCCCGGGC CCGGCGGGCC GGGCGAGCGA CGCCCGTAGC CAAGTCAGCG TCCGTCGCCG GGCCGACCTC CGCCGCCGAG CCGCCGCCCG GTGTGGGGGT AATCGCCCCA CATGAGGTCG GCCTCTTCGG CGAGGATGGC GTGGGTCCGC ACGCGCGGCG ATGA
|
Protein sequence | MCGRYTQTLS ADDLAAAMSA ADETGGRVTE SYNVAPTTVM PIVVARRPPG DAGHGEPAAG VVTGGAGESG EAGETGAGGG PRAAGGGRVL RLATWGLVPF WAKDPAIGSR LINARAESVA SKPAFRAAFA ARRCLVPATG FYEWRRPGGS RRGQPYYIHP AGHPGADGLF AFAGLYEVWS KGEQPLTTFT ILTTDAAAGI EFIHDRSPVV VPRPAWSRWI DPTLRDPEAL AGILRPAPAG VFAAHPVSPE VGSVRNTGRH LVDPVDVDPE EAAAAAGPPA TAGPSRARRA GRATPVAKSA SVAGPTSAAE PPPGVGVIAP HEVGLFGEDG VGPHARR
|
| |