Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7032 |
Symbol | |
ID | 5675343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8579736 |
End bp | 8580827 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245878 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001511269 |
Protein GI | 158318761 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGACC AGGAAGGGAT GCGGTGGACG GCATCCGGGC GGGTATCTGG ATCGCCTCGC CGTCGCGCCA TCGCAGCGCT GGTGCTGGTG CTCGCCGCTG CGCTGGTCCT GTTCCGGTGC GGCGGCTCCT CGGTCCAGGT CACGGATGGC CGGCAGGGCT CCGGTGGCAA CCCGACTGTT GGGCTGATCA CGAAGTCTTA CACTAACCCG TTCTTCGTGA AGATGCGCGA CGGCGCGCAG CAGGCCGCGC GGGAGCAGAA GGTTGAGCTG TTGACCGCCA CCGGCAGGTT CGACGGCGAC TATGCCAGTC AAGTCAGCGC CATCGAGAAC ATGGTAGCGG CCGGGGCGCG GGGCATCCTC ATCACACCCA ATGACAGCAA GGCGATCGTC CCGGCGATCG AGCAGGCCCG GCACCGTGGT GTTCTCGTCA TCGCTCTAGA CGTGCCCACC GACCCGGAGA GCGCCGTCGA CGCGCTGTTT AGCACCGACA ACTTCAAGGC CGGCATACTG ATCGGCGAGT ACGCCAGGGC CGCTATGGGC GACACGCCGG CCAGGATCGC AACCATGGAC GTCTCTTCGC ACATCACGGG CGGAGGCCTG CTGCGACACA ACGGTTTCCT CGTCGGCTTC GGCGCCTTGG ACGTGACTGT CAGTGAGACT CAGCAGGCCA CTCCGCCGAG CGTGGTGTGC AGCCGGGATT CCAAGGGTGA CCAGGCCAAG GGGCGGACGG CGATGGCGGA CTGTCTGCGG ACGGACCCGG ACATCAACCT CGTGTACGCC GTGAACGAAC CGGCCGCGTT CGGCGCGCGG ACCGCCCTGG ACGCGGCCGG AAAGGCAGAC GTCATGATCG TCTCCATCGA CGGCGGATGC ACCGGCGTCC GGGCGGTCAG GGACGGCAAG ATCGCTGCTA CCTCACAGCA GTACCCGCTG AAGATGGCCG AGCAGGGAGT GGCCGCCGTG GTCGACTACG TCAAGGACGG AACGAAAGTA TCCGGATACG TCGACACCGG CACCACCCTG ATCGCCGACG ATCGTCAGCC TGGAATCCCT TCGGAAGGCG TCGAGTACGG TCTGGCGAAT TGCTGGGGCT GA
|
Protein sequence | MSDQEGMRWT ASGRVSGSPR RRAIAALVLV LAAALVLFRC GGSSVQVTDG RQGSGGNPTV GLITKSYTNP FFVKMRDGAQ QAAREQKVEL LTATGRFDGD YASQVSAIEN MVAAGARGIL ITPNDSKAIV PAIEQARHRG VLVIALDVPT DPESAVDALF STDNFKAGIL IGEYARAAMG DTPARIATMD VSSHITGGGL LRHNGFLVGF GALDVTVSET QQATPPSVVC SRDSKGDQAK GRTAMADCLR TDPDINLVYA VNEPAAFGAR TALDAAGKAD VMIVSIDGGC TGVRAVRDGK IAATSQQYPL KMAEQGVAAV VDYVKDGTKV SGYVDTGTTL IADDRQPGIP SEGVEYGLAN CWG
|
| |