Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2061 |
Symbol | |
ID | 5670462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2484335 |
End bp | 2485441 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240983 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_001506404 |
Protein GI | 158313896 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.282535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCCG ACGCTCCCGC GATCGTCGCG TTCGGCGGCG GGCACGGCCT GGCCGCGTCG CTGGCCGCGC TGCGCCGGAT CACCCGCCAC CTGACCGCGG TGGTTACCGT GGGTGACGAT GGTGGTTCGT CGGGCCGGCT GCGCGCCGAG CTCGGCGCCC TGCCCATGGG CGACCTGCGG ATGGCCCTCG CCGCGCTGGC CGGCGCCGAC GAGTGGTCCC AGACGTGGGC CGACCTCTTC CAGCACCGGT TCGGCGGCGG CGGGGCGCTC ACCGGCCACG CGGTCGGCAA CCTCGTGCTG ACGGCGCTGG CGGAGCGCGC GGGCTCCCCG GTGGCCGCGC TCGACCTGGC GGCGTCGCTG CTCGGCGTCG ACGGCCGGGT GCTGCCACTG TCATGCGCCG GCATCGACAT CGTCGCGGAC GTGACCGGCC TCGACCCGAA CCGGGCCGGG GAGTCCGCGG AGGTCCGCGG CCAGGCGGCC GTCGCGACGA CCCCCGGCCG GGTGGCCGGG GTACGCCTGG CCCCGGCGGA GCCGGCGGCC TGCGGTGCCG CGCTCGCGGC CGCCGCCGCC GCCGACTGGA TCGTCCTCGG TCCGGGCTCG CTCTACACGA GCGTGCTGCC GCACCTGCTC GTGCCCGACA TGCGCGCGGC GATCACGGGC GCCGACGCCC GCCGGGTGAT GGTGCTCAAC CTCGTGGCGC AGCCGGGCGA GACGGCCGGC TACACCCCCG AGGCCCACCT GCACGCCCTG GCCACGCACG TCCCGGGGCT GCGCCTCGAC GTGGTGATCG CGGACCCGGC CGCGGTCGGC GACCCGGACC CGCTGGCCCG CGCGGCCGCG GACCTGGGCG CCCGGCTCCA CCTCGCACCC GTGCGGGTTC CCGGTGAACC CGCACTGCAC GACCCCGAAC GTCTCGCGGC CGCGTTCCGC GCCGTCTTCG CGCAGGACGG CGCCGTGGCG GTGCCGTCGG CGGCCCGTCC GATGGACGGC CAGCGGGCCT GGCCTGGCCG GCAGCCGGGA GCGGCCTGTG CCGACCCCTC GGGTGTCGGG TGCTCATCCA CCGGTGGCTC CGGCCACCGG ATCCCCTCCG GTGAATGCAA GGAGTGA
|
Protein sequence | MTADAPAIVA FGGGHGLAAS LAALRRITRH LTAVVTVGDD GGSSGRLRAE LGALPMGDLR MALAALAGAD EWSQTWADLF QHRFGGGGAL TGHAVGNLVL TALAERAGSP VAALDLAASL LGVDGRVLPL SCAGIDIVAD VTGLDPNRAG ESAEVRGQAA VATTPGRVAG VRLAPAEPAA CGAALAAAAA ADWIVLGPGS LYTSVLPHLL VPDMRAAITG ADARRVMVLN LVAQPGETAG YTPEAHLHAL ATHVPGLRLD VVIADPAAVG DPDPLARAAA DLGARLHLAP VRVPGEPALH DPERLAAAFR AVFAQDGAVA VPSAARPMDG QRAWPGRQPG AACADPSGVG CSSTGGSGHR IPSGECKE
|
| |