Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3943 |
Symbol | |
ID | 5672304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4712443 |
End bp | 4713654 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242822 |
Product | ABC transporter related |
Protein accession | YP_001508239 |
Protein GI | 158315731 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3839] ABC-type sugar transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.39994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0172807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCAGGA TCGAGCTCGA CGGGCTGACG AAGAGATACG GCGACGTCGT CGCCGTGGAC GGCGTCAGCC TCGACATCGC CGACGGCGAG TTCCTGGTGC TGCTCGGCCC GAGCGGCTGC GGGAAGTCCA CCCTGCTGCG GCTCGTCGCC GGCCTGATCA CACCGTCCGG CGGCCGGGTG CTGCTCGACG GCCGGGACAT CACCCACGCC CCACCCGCCC GGCGCGACCT GGCGATGGTC TTCCAGAGCT ACGCCCTGTA CCCGCACCTG ACCGTGGCCC GCAACATCGG CTTCCCGCTG CGCGCGGCCC GCACGCCCCG CGCCGAGGTC CGCCGCCGGG TCGAGGAGGT CGCCGCGCTG CTCGAGCTCG GTCCGCTGCT CGACCGCCGG CCGCGGGAGC TCTCCGGCGG CCAGCGCCAG CGGGTCGCCG TCGGCCGGGC GATCATCCGC GACCCGCGGG CGTTCCTGAT GGACGAGCCG CTGTCGAACC TGGACGCCAA GCTGCGCCAG GCGACCCGGG CCCAGTTCCG GATGCTGCAC GAGACCCTCG GCAGCACCGT CCTCTACGTC ACCCACGACC AAGTCGAGGC GCTCAGCCTC GCGACCCGGA TCGCGATGCT CGACGGCGGG CGCCTCGAGC AGCTCGGCAC GCCGACCGAG GTCTACGACA GCCCGGCCTC GGTGTTCGTC GCTGGCTTCC TCGGCTCCCC GCCGATGAAC CTGCTGCCCG CCCGAGTGGA GTGCCACGAC GGCCGGGTGC GGGTGCTCGC CGACGACGTC GAGGTCGACC TGTGGCCCGG CGAGGACGTC GAGGCCCGTG ACGTGATCCT CGGGATCCGG CCCGAACACC TCCACCTGAT GGCCGCCGAG GGCCCGGCGG GCCCGGGCGG GATGCCGCGG CTGCGCGGTG TCGTGCGGGC GGTGGAGAAC CTCGGCGCCG AGGAGTCCGC GCAGTGCGCG GTCGGCGGCG CCCTCGTGCA CTTCCGGGGC GCCCGCCCCC TCGGGCTGGC CGCCGGCCAG CCCGTCGCGC TCACCACGGC GCGCGACCAC ATCCACCTGT TCGACCGGCA CAGCGGCCGG CGGCTCGCCT GGCGCCCGCT GGCCGACCGC CCGCTGGCCG ACCGCCAGTC GGCCGACCAC GAGCCGGCCG ACCACGAGCC GGCCGCGGCA CTCCACGACG GCGTGACCAC TCATCAAAGG AGCTCGACGT GA
|
Protein sequence | MGRIELDGLT KRYGDVVAVD GVSLDIADGE FLVLLGPSGC GKSTLLRLVA GLITPSGGRV LLDGRDITHA PPARRDLAMV FQSYALYPHL TVARNIGFPL RAARTPRAEV RRRVEEVAAL LELGPLLDRR PRELSGGQRQ RVAVGRAIIR DPRAFLMDEP LSNLDAKLRQ ATRAQFRMLH ETLGSTVLYV THDQVEALSL ATRIAMLDGG RLEQLGTPTE VYDSPASVFV AGFLGSPPMN LLPARVECHD GRVRVLADDV EVDLWPGEDV EARDVILGIR PEHLHLMAAE GPAGPGGMPR LRGVVRAVEN LGAEESAQCA VGGALVHFRG ARPLGLAAGQ PVALTTARDH IHLFDRHSGR RLAWRPLADR PLADRQSADH EPADHEPAAA LHDGVTTHQR SST
|
| |