Gene Bind_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2241 
Symbol 
ID6200417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2572911 
End bp2574242 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content57% 
IMG OID641706229 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001833347 
Protein GI182679201 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.896869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGA TAAATCCGGA AAATCTCGCC AAGCCTCTCG GCGCGCCCCG CGCCGTGCCA 
TTCTACCGCA TCCTCTATGT CCAGGTGCTG ATCGGCATTT TGCTTGGCGT CCTGTTTGGT
TGGCTCTGGC CGGCGACGGC TTCCGCCGAT TGGGTGAAGG CGCTCGGCGA TGGCTTCATC
AAACTCATCA AGATGATGAT CGCGCCGATT ATCTTCTGCA CCGTGGTCTC CGGCATCGCG
CATGTCTCGA GCGCGCAAAA GGTCGGGCGC ATCGGTGTCA AGGCGCTCGT CTATTTCGAG
GTGGTCTCGA CCTTCGCTTT GGTTCTCGGA CTTGTGATCG GTAATCTCGT GCAGCCCGGC
CGCGGCATTG CCGACAAGGT CGATGCAGGA GCCGTCGCCA AATATGTGGG GCAGGAGCAT
TCGACAGTTC AGTTCCTGCT CGATCTCATT CCGGATAGCG TGGTCGGTGG TTTCGCGCGC
GGCGATGTGC TGCAAGTTCT TTTATTTTCC ATTCTCTTCG GCTTTGCCCT TCTGGGGCTT
GGATCTCGCG GCGAAAGCCT GACGAAGCTG ATCGACGATG TTGCACAGGC GGTTTTCGGC
GTCATCGCTA TCGTCATGCG GGCGGCGCCG ATTGGCGCCT TCGGTGCCAT GGCCTATACG
GTCGGCCGCT ATGGTCCGCA AACGCTCGGC AATCTCCTTG GTCTTGTCGC CACCTTCTAT
CTCACGGCGA TTTTGTTCAT CGTGATCGTC CTCGGGACGA TCGCCAGGCT CGCGGGTTTC
AGCATTTTCA AATTCCTTGC CTATATCAAA GATGAATTGC TGATCGTGCT TGGCACCAGT
TCTTCGGAAA GCGCCTTGCC GGCCTTGATG GAGAAACTCG AACGGCTCGG CTGTTCCAAA
CCCGTCGTCG GTCTCGTGGT GCCGACTGGC TATTCCTTCA ATCTGGACGG CACCAATATC
TATATGACAC TCGCGACCCT TTTTGTCGCG CAGGCGCTCG GCGTTCCCTT GAACCTTGAG
GAACAGATTA CGATTCTTTT GGTCGCCATG GTTGCCTCAA AAGGGGCGAG CGGTATTTCT
GGCGCAGGAT TCATCACGCT CGCGGCGACG CTCGCCGCCG TCAATCCTAT TCTTGTGCCG
GGCATGGCCA TGTTGCTCGG AGTCGATAAA TTCATGAGCG AATGCCGGGC CTTGACCAAT
ATTATCGGCA ATGGGGTGGC AACGATCATC GTCTCCCGCT GGGAAAAGGA GATCGACCCC
CGAGCGTTGC GCGCCGCATT GGAAAAGACC GTGGATACAA CCCATTTCAC GGCGGAGGAT
CAGCCGCTTT AG
 
Protein sequence
MVAINPENLA KPLGAPRAVP FYRILYVQVL IGILLGVLFG WLWPATASAD WVKALGDGFI 
KLIKMMIAPI IFCTVVSGIA HVSSAQKVGR IGVKALVYFE VVSTFALVLG LVIGNLVQPG
RGIADKVDAG AVAKYVGQEH STVQFLLDLI PDSVVGGFAR GDVLQVLLFS ILFGFALLGL
GSRGESLTKL IDDVAQAVFG VIAIVMRAAP IGAFGAMAYT VGRYGPQTLG NLLGLVATFY
LTAILFIVIV LGTIARLAGF SIFKFLAYIK DELLIVLGTS SSESALPALM EKLERLGCSK
PVVGLVVPTG YSFNLDGTNI YMTLATLFVA QALGVPLNLE EQITILLVAM VASKGASGIS
GAGFITLAAT LAAVNPILVP GMAMLLGVDK FMSECRALTN IIGNGVATII VSRWEKEIDP
RALRAALEKT VDTTHFTAED QPL