Gene Mmar10_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1070 
Symbol 
ID4284479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1166830 
End bp1168407 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content65% 
IMG OID638140542 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_756301 
Protein GI114569621 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.019749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCA ATCTCGCCGA GCGCTGGGCT GAACTTCCGG ACCTTCTCGC CGGACACATG 
CTGTTGTCGC TGGCGGCCAT ACTGGTCGGC CTGTCGATCA GCCTGCCGCT TGGAATTCTT
GCGGCGTCCC GACCGCGCCT GGCGGCGGTC GTGCTCAATG CGGCCAGCGT CATCCAGACC
ATTCCCGGCC TTGCCCTGCT GGCATTGATG GTACCGCTTC TTGGCGGGAT GATCGGTTAT
GCGCCAGCCT TCCTGGCGCT GATGCTCTAT TCGATCCTGC CGATCCTGCG CAACACGATT
GTCGGGCTCC AGGGCCTTGA TCCGGTTGTG CGCGAGGCTG CCCGCGGCAT CGGCATGACG
CCGCGTGAAC GGCTGTGGCA GGTCGAGCTG CCGCTGGCCC TGCCGGTCAT TGTCGCCGGA
TTGCGTACGG CGGTGGTGTG GGTGGTTGGC GCGGCGACGC TGGCAACGCC GGTCGGGGCC
TCCAGCCTGG GCAATTACAT CTTTGCCGGC CTGCAGACCC GCAATTGGCT GTCCGTCCTG
TTCGGCTGCC TGTTTGCGGC CGGGCTGGCC ATTGCGCTGG ATCAGATCAT CCGCTTGCTG
GAAGTCGCCG CCCGGGATCG CCGCAAGGGG TTGGCGGTCG CTGTCGGCGT CGCGCTTTCG
GCACTGGTCG TCATCACCAT CGCACCGCGC TATGTGCCGG CCGCTGTGTT CCGAGGGGCG
GGGCCGGTCG TGGCGCTTCA GGACGGTGCC GACACCCTTG CCGATCAGAC TGTTGTGGTC
GGCGCCAAGG CCTTCACCGA GCAATACATT CTCGGTGCCC TGATGCAGAC GCGTCTGGAA
GAGGCCGGGG CCCGCGTCGA CCGGCTCGAC AATCTGGGTT CCACCATCGC CTTCGATGCC
CTGCGCACGG GCGAGATCGA CGTCTATATC GACTATACCG GCACAATCTG GGCGACGGTG
ATGAACCGTT CCGAGCCGAT CAACCGCAAT GCGATGCTCG CCGAGATGAC CGGCTGGCTG
TACGCCGAAC ACGGCATTCT GGCGCTTGGC GGGCTGGGAT TTGAAAACGC CTACGGGTTT
GCCATGGCCC GCGACCGGGC CGAAGCGCTG GGAGTCGACA GTCTCGCCGA CCTTGCTTCC
CAGGCCGAAG GCATGAGTGT CGGAGGCGAT GCGGAGGTCT TTTCGCGGCC CGAATGGGTG
AACACCCGCA ACCGCTACGG CCTCGGGGCG ATGCAAACCC GCGCCATGGA TGCCGCCTTC
ATGTATGGCG CTGTGCGGGA CGGCCAGGTC GACTTGATCT CTGCCTATAC GACTGACGGC
CGCATTGCGG CGTTTGACCT GCTGGTCCTC GCCGATCCAC TGGCGGTGTT GCCGCCCTAT
GACGCGGTGA TCCTTCTGTC GCCCGATGCG GCAGCCAATC CGGCCCTGGT CGAGGCGTTG
TCCGGCCTGA TTGGTGCGAT CGACAGTGAA GCGATGCGTG AGGCCAATCG CCTTGTCGAT
CTGGATCGTC AGACGCCGGA CCAGGCGGCA ATCTGGCTGT CGGACCATTT GCAGGCGGGT
GCGGGAGGCA ATCAATGA
 
Protein sequence
MSSNLAERWA ELPDLLAGHM LLSLAAILVG LSISLPLGIL AASRPRLAAV VLNAASVIQT 
IPGLALLALM VPLLGGMIGY APAFLALMLY SILPILRNTI VGLQGLDPVV REAARGIGMT
PRERLWQVEL PLALPVIVAG LRTAVVWVVG AATLATPVGA SSLGNYIFAG LQTRNWLSVL
FGCLFAAGLA IALDQIIRLL EVAARDRRKG LAVAVGVALS ALVVITIAPR YVPAAVFRGA
GPVVALQDGA DTLADQTVVV GAKAFTEQYI LGALMQTRLE EAGARVDRLD NLGSTIAFDA
LRTGEIDVYI DYTGTIWATV MNRSEPINRN AMLAEMTGWL YAEHGILALG GLGFENAYGF
AMARDRAEAL GVDSLADLAS QAEGMSVGGD AEVFSRPEWV NTRNRYGLGA MQTRAMDAAF
MYGAVRDGQV DLISAYTTDG RIAAFDLLVL ADPLAVLPPY DAVILLSPDA AANPALVEAL
SGLIGAIDSE AMREANRLVD LDRQTPDQAA IWLSDHLQAG AGGNQ