Gene Mmar10_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1642 
Symbol 
ID4284228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1800098 
End bp1801036 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content63% 
IMG OID638141129 
Productbile acid:sodium symporter 
Protein accessionYP_756872 
Protein GI114570192 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0689619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.549741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCA GCGCGATCGA CCAATTGCAG GTAATAGTTG ATGACCAATC GCGGTTGGGG 
ATCGCCGCGA TTCTCTTTGT GATGATGTTC TCGGTCGCGT TGACGCTGAG GCTGGAAGAC
TTCGCATTGA TACGGCGCCA ACCTGGTCGC GTGCTGGGTG GGGTCGCGGT CCAGCTGGTT
GGCCTGCCCC TGCTGACCCT TGGCCTTATC CTCCTCTTGT CGCCGCCGGC CAGCATCGCC
CTGGGCATGT TGATCGTGGC GAGCTGTCCG GGCGGGAATG TCTCCAACCT GCTGACCCGC
GCGGCCGCCG GGAATACGGC CTACTCGGTC ACCTTGACGG CCATTTCCAG CGTCTCTTCA
GCGATCATGA CCCCGCTTTC AATCCTGTTC TGGTCCGGGC TCTATGCGCC GGCCGGTGCG
CTGGTCCGAT CGCTGGACGT CGACCCTTTG CCCTTTTTTG CGCAGACGGC GGTTCTGCTC
GCAGTTCCCC TGATCCTGGG CATGGCGCTC AATCAGTGGC GTCCAGCCCT GGCGGGTCGC
CTGGCAGCGG TCCTCGGTCC CTTGGCGCTG GCCTGTATCG CGCTGCTGGT CGTCGTCGGC
ATCGTCCAGA ACTGGGCCCT GATCCTTGCC ACCGGTGCTA TCATCATCCC CATCGTCGTT
CTCCATAATG GGTCTGCCTT CGCGCTTGGC TGGCTGGGCG GGCGTGCCAT GGGCATGGAA
GCGGCCCGCC GCCGGGCCCT GACCTTTGAA GTCGGCATCC AGAATTCCGG GCTTGGCCTG
GTCATCCTCC TGAGCCAGTT CGAAGGTGTC GGCGGTGCCG CGGCCATTAT CGGCACCTGG
AGCATCTGGC ACCTGGTGGG TGGATCACTG GTTGCGGGGT TGTTCCGCTG GATGGATTCA
CGGACACTCC TCGCACACGC CAGAGAGCGT GACTCATAA
 
Protein sequence
MDASAIDQLQ VIVDDQSRLG IAAILFVMMF SVALTLRLED FALIRRQPGR VLGGVAVQLV 
GLPLLTLGLI LLLSPPASIA LGMLIVASCP GGNVSNLLTR AAAGNTAYSV TLTAISSVSS
AIMTPLSILF WSGLYAPAGA LVRSLDVDPL PFFAQTAVLL AVPLILGMAL NQWRPALAGR
LAAVLGPLAL ACIALLVVVG IVQNWALILA TGAIIIPIVV LHNGSAFALG WLGGRAMGME
AARRRALTFE VGIQNSGLGL VILLSQFEGV GGAAAIIGTW SIWHLVGGSL VAGLFRWMDS
RTLLAHARER DS