Gene Dshi_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1437 
Symbol 
ID5712614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1492683 
End bp1493738 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID641267350 
Productglycine betaine/L-proline ABC transporter 
Protein accessionYP_001532780 
Protein GI159043986 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.102284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.596955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCG ATACCGTCAT CGAGATATCG AATGTCTGGA AGATTTTCGG CGCCAACGCC 
CAGGCAGCCC TTGAGGCGGT CCGCGACCGG GGGCTGAGCA AGGCCGAGAT CCTGGCCGAA
TTCAACGCGG TCGTGGGCGT GGCCGATGTC AGCCTGTCGG TGCGGCGCGG CGAGATCTTT
TGCATCATGG GGCTGTCGGG CAGCGGCAAG TCCACGCTGG TGCGCCATTT CAACCGCTTG
CTGGAGCCGA CCGCGGGCAG GATCGAGATC GAGGGGACCG ATGTCATGGC GCTCGGCACC
CAGGAGCTTC AACGCTTCCG CAACCGACAG ATCGGCATGG TGTTCCAGAA CTTCGCGCTG
ATGCCGCACC GTTCGGTGCT GGACAACGTG GCGATGCCAC TGGAGATCCG GAAGGTCCCC
AAGAACGAGC GCATGCGCCA GGCCGCCGCG ATCCTCGACA TCGTCGAGCT GGGCGCCTGG
GGGGCGAAGT TCGCCCATGA ACTGCCGGGC GGGATGCAGC AGCGGGTGGG GCTGGCCCGG
GCGCTGGCGG CGAATCCGGA CGTGTTGCTG ATGGACGAGC CCTTCTCGGC ACTCGATCCG
CTGATCCGAA GGCAGTTGCA GGACGAATTC ATCCGATTGT CGAAGATCCT CAAGAAAACC
ACGATATTCA TCACCCATGA CCTCGACGAG GCGGTGCGCA TCGGCGACCG GATCGCCATC
ATGCGCGACG GCAAGGTGGT GCAGATGGGC ACCGCCGAGG ACATCGTGAT GCACCCGGCC
GATGACTACG TGGCCGATTT CGTGGCCGGG ATCTCGCGGC TCAAGGTGGT TCATGCCCAC
GCGGTGATGC AGCCGCTGGA GGCCTATCTC GCCACTCACG GCCCGCTTCC GGCCGCCGTC
CCCAAGGTCG ACGAGGGCGA AACCCTGAGC AACCTGATCA CGCTCGCCAT CGATGACGAG
AATCCGATCC TCGTGCAGGA CGCGGGTCGG GACGTCGGTA TCATCACCCG TGCGGACCTG
TTGCGCACGG TCATCGAGGG AACGGAAGTC TCATGA
 
Protein sequence
MHGDTVIEIS NVWKIFGANA QAALEAVRDR GLSKAEILAE FNAVVGVADV SLSVRRGEIF 
CIMGLSGSGK STLVRHFNRL LEPTAGRIEI EGTDVMALGT QELQRFRNRQ IGMVFQNFAL
MPHRSVLDNV AMPLEIRKVP KNERMRQAAA ILDIVELGAW GAKFAHELPG GMQQRVGLAR
ALAANPDVLL MDEPFSALDP LIRRQLQDEF IRLSKILKKT TIFITHDLDE AVRIGDRIAI
MRDGKVVQMG TAEDIVMHPA DDYVADFVAG ISRLKVVHAH AVMQPLEAYL ATHGPLPAAV
PKVDEGETLS NLITLAIDDE NPILVQDAGR DVGIITRADL LRTVIEGTEV S