Gene Dshi_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2119 
Symbol 
ID5713115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2249037 
End bp2250611 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content63% 
IMG OID641268041 
Productbetaine/carnitine/choline transporter family protein 
Protein accessionYP_001533456 
Protein GI159044662 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATCC GATCAGGACC GTTCAGGGGG GTGAATCCAC ACACATCCCT GATATCGGCA 
GGGATAATCC TGCTTTTCGT GTTGCTTGTG CTGGCGTTTC CAAGCCGTTC GGCCGACTGG
ATCGACGCCG CCAGAACCCT TATCACGTTC TATTTCGGCT GGTGGTACGT GCTGCTTTCG
GGCGTGTTTC TCGTATTCCT GATCGCAATC GCGTTCAGCA AATACGGCGC GATCCGGCTC
GGAGACGCGG ACGAGCGCCC GAAATACAGC TACTTTACCT GGTTCGCGAT GCTCTATGCC
GCGGGGCAGG GGATCGGGAT CATCTTCTGG TCCATCGCCG AGCCGATGTT CCACTACTCG
GGCGGCACGC CCTTCGCCGA TGGCAGCGGC ACCGCGGCTG CGGCGGACAT GGCGATGCAG
GTGACCTTCT TTCACTGGGG CCTGAACGCC TGGGCGATCT ACTGCATCGT GGCCCTGGCC
TTGTGTCTGG TCAGCTACCG GCTGAAGAAG CCGCTCGGCA TCCGCTATAC CCTCTATCCG
TTGTTCGGCG ACCGTGTCGA AGGCCCGCTG GGCGTGGTGA TCGACGTGGT GGCGGTGTTC
GCAACCATCT TCGGCATCGC GACCTCCCTT GGCCTGGGGG TCACGCAGAT CAACGCCGGC
CTCAACCACC TGTGGGGCGT TCCGATCTCC GAGACGGTCC AGCTTGTGCT GATTGCCGCG
ATCACGGCTG TGGCGCTGTG CTCGGTCCTG TCGGGCCTTG ACCGGGGCAT TAAGTGGCTG
TCGCAGGTCA ACATGTGGCT GACGATCGCC CTGCTGGTCT TCTTCTTCAC CTGGGGGCCG
ACCCAATACC TGCTGGTGAG CCTGGGCGAG GTGACGCTGG CTTACTTTGT CAGCCTCTTC
TCCTTCAACG TCTACATCGA AAGCGTGCCC GCCGAGGCAA CCCGTTGGAG CGACATGTGG
CAGGGCTGGT GGACGACCTT CTACTGGGGT TGGTGGATTT CCTGGGCGCC CTTCGTGGGG
GTGTTCGTGG CGCGCGTGTC GCGGGGCCGG ACGGTGCGGG AATTCATCCT TGGCGTCGTG
GGCGTGTCCT CGATCCTGTC CTTCGTCTGG ATCGTGGCCT ATGGCGGCAC CGCGCTCTGG
GCCGAGGTTC TGGGGCCGGG GGGCGTGTCC GATGCGGTGA GCGCGAATGT CTCCATGGCG
CTCTTTGCCA CCTTCGATGC GATGGATGTG GGGGCCATCG GCTTGGTCGC CGGCGTGTTC
GGCACGATCC TCGTGACGAC CTATTTCGTG ACCTCCTCGG ATTCCGGCAC GCTCGTTGTC
GCCACGATCC TCAGCGAGGG CAACGAGCAT CCGATGTATC GCCACCGCAT GATCTGGGGC
ACGTTCGAAG GTGTCGTCGC CGCCGTCCTG CTGGTCGTTG GCGGGAGTGC CGCGCTCAGC
ACGCTGCAGA CCGCGGCGAT CATCGCCGCA CTGCCGTTCT CGGTGATCAT GGTGCTTATG
TGTTTCGCGA TCATCCGCTG CCTTGCGCTC GAACATGGCA AAGAGGCGAT CAGATCCGCC
GACAAGACCG CCTGA
 
Protein sequence
MLIRSGPFRG VNPHTSLISA GIILLFVLLV LAFPSRSADW IDAARTLITF YFGWWYVLLS 
GVFLVFLIAI AFSKYGAIRL GDADERPKYS YFTWFAMLYA AGQGIGIIFW SIAEPMFHYS
GGTPFADGSG TAAAADMAMQ VTFFHWGLNA WAIYCIVALA LCLVSYRLKK PLGIRYTLYP
LFGDRVEGPL GVVIDVVAVF ATIFGIATSL GLGVTQINAG LNHLWGVPIS ETVQLVLIAA
ITAVALCSVL SGLDRGIKWL SQVNMWLTIA LLVFFFTWGP TQYLLVSLGE VTLAYFVSLF
SFNVYIESVP AEATRWSDMW QGWWTTFYWG WWISWAPFVG VFVARVSRGR TVREFILGVV
GVSSILSFVW IVAYGGTALW AEVLGPGGVS DAVSANVSMA LFATFDAMDV GAIGLVAGVF
GTILVTTYFV TSSDSGTLVV ATILSEGNEH PMYRHRMIWG TFEGVVAAVL LVVGGSAALS
TLQTAAIIAA LPFSVIMVLM CFAIIRCLAL EHGKEAIRSA DKTA