Gene Franean1_7032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7032 
Symbol 
ID5675343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8579736 
End bp8580827 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID641245878 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001511269 
Protein GI158318761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACC AGGAAGGGAT GCGGTGGACG GCATCCGGGC GGGTATCTGG ATCGCCTCGC 
CGTCGCGCCA TCGCAGCGCT GGTGCTGGTG CTCGCCGCTG CGCTGGTCCT GTTCCGGTGC
GGCGGCTCCT CGGTCCAGGT CACGGATGGC CGGCAGGGCT CCGGTGGCAA CCCGACTGTT
GGGCTGATCA CGAAGTCTTA CACTAACCCG TTCTTCGTGA AGATGCGCGA CGGCGCGCAG
CAGGCCGCGC GGGAGCAGAA GGTTGAGCTG TTGACCGCCA CCGGCAGGTT CGACGGCGAC
TATGCCAGTC AAGTCAGCGC CATCGAGAAC ATGGTAGCGG CCGGGGCGCG GGGCATCCTC
ATCACACCCA ATGACAGCAA GGCGATCGTC CCGGCGATCG AGCAGGCCCG GCACCGTGGT
GTTCTCGTCA TCGCTCTAGA CGTGCCCACC GACCCGGAGA GCGCCGTCGA CGCGCTGTTT
AGCACCGACA ACTTCAAGGC CGGCATACTG ATCGGCGAGT ACGCCAGGGC CGCTATGGGC
GACACGCCGG CCAGGATCGC AACCATGGAC GTCTCTTCGC ACATCACGGG CGGAGGCCTG
CTGCGACACA ACGGTTTCCT CGTCGGCTTC GGCGCCTTGG ACGTGACTGT CAGTGAGACT
CAGCAGGCCA CTCCGCCGAG CGTGGTGTGC AGCCGGGATT CCAAGGGTGA CCAGGCCAAG
GGGCGGACGG CGATGGCGGA CTGTCTGCGG ACGGACCCGG ACATCAACCT CGTGTACGCC
GTGAACGAAC CGGCCGCGTT CGGCGCGCGG ACCGCCCTGG ACGCGGCCGG AAAGGCAGAC
GTCATGATCG TCTCCATCGA CGGCGGATGC ACCGGCGTCC GGGCGGTCAG GGACGGCAAG
ATCGCTGCTA CCTCACAGCA GTACCCGCTG AAGATGGCCG AGCAGGGAGT GGCCGCCGTG
GTCGACTACG TCAAGGACGG AACGAAAGTA TCCGGATACG TCGACACCGG CACCACCCTG
ATCGCCGACG ATCGTCAGCC TGGAATCCCT TCGGAAGGCG TCGAGTACGG TCTGGCGAAT
TGCTGGGGCT GA
 
Protein sequence
MSDQEGMRWT ASGRVSGSPR RRAIAALVLV LAAALVLFRC GGSSVQVTDG RQGSGGNPTV 
GLITKSYTNP FFVKMRDGAQ QAAREQKVEL LTATGRFDGD YASQVSAIEN MVAAGARGIL
ITPNDSKAIV PAIEQARHRG VLVIALDVPT DPESAVDALF STDNFKAGIL IGEYARAAMG
DTPARIATMD VSSHITGGGL LRHNGFLVGF GALDVTVSET QQATPPSVVC SRDSKGDQAK
GRTAMADCLR TDPDINLVYA VNEPAAFGAR TALDAAGKAD VMIVSIDGGC TGVRAVRDGK
IAATSQQYPL KMAEQGVAAV VDYVKDGTKV SGYVDTGTTL IADDRQPGIP SEGVEYGLAN
CWG