Gene Mmar10_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1626 
Symbol 
ID4284596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1781491 
End bp1782921 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content68% 
IMG OID638141113 
ProductXRE family transcriptional regulator 
Protein accessionYP_756856 
Protein GI114570176 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.928013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.297966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACCAGGCTTT CGAGAAGATC TTCGCCGGCG CCCGCCTGCG CCGCCTCAGG 
CGCGAACGTG GCATGACCCA GGCCGAGGCG GCCGAAGCCC TGGGCCTGTC GGCCAGCTAT
CTCAACTTGC TGGAACGCAA TCAGCGCCCG GTGACCGCGC GGGTGCTGCT GGCCCTGGCC
GAAGCCTTCG ACGTGGATGT CCGCAGCTTT GCCAATGAGA GCGATCGGCA ATTGATCGCC
GACCTCACCG AGGCGGCTGC CGACCCGGTC CTGGCCGGTC TCGAGCTCGA CCGCGTGGAA
CTCAACGAGC TCGCCGACAG CCAGCCGCGC GCCGCCGAGG CCCTGACCCG CCTGTTCCAG
TCCTATCGGG AAATGGCCAC AGCCACGGCC GATCTCGCCA CGCGCATGTC CGGCCCCGGC
GCGACAACGG GCGGCCCCGG TGTGGTGCTG GAATCCGTCC GCAACGCCAT CGACGCTCAC
CACAATCACT TCCCCGACCT CGAGGAAGCC GCCGAGGCCC TGAGCGACCG GGCCGGTCTG
CGCAGTCGCA ATCGCGACCA GGCGCTGGCA GCCTATCTGC AGGACCAGCA CGGCTTTACC
GTTCGCGTGC TGGACGAGGA CGTGATGGCC GGCGCCCGCC GCCGCCTGGA CTTTCACGGT
CGCCGCCTGC TGCTGTCGGA GACCCTGCCC CCCGCCTCAC GCGGCTTTCA CATGGCGGTC
GTGCTGGCGA GCCTGGAACA GGCCGATCTT CTCGACAGCC TGTGTGACCA GATCGATCTG
CCCAGCGCCG AGGGACGGCG CCTGCTCAGG ATCGGTCTCG CCAATTATTT CGCCGGCGCG
GTGCAGATGC CCTATGCCGC TTTCCACAGG GCCGCCGAGA CCAATCGCTA TCATCTCGGC
GTTCTGCAAC GCCGTTTCGA GGCCAGCTAT GAACAGGTCT GCCATCGCCT GACCACGCTG
CAGCGACCGG GCGCGCGCGG CCTGCCCTTC TTCATGATCC GGGTCGATGC GGCGGGCAAT
GTCTCCAAGC GCTTCGGCGG CGGCATCATG CCCTTTGCCC GCGCCGGCGG CGGCTGTCCG
AAATGGAATC TCTACGACGC GCTGCGGATG CCCGAGCGGA TCCTGACCCA GTCCTTCGAA
CTGCCCGACG GCACCCGCAT GCTGTCGCTC GCCCGTGGCC AGTCAACGCA AGGCCCCACA
GGACAGCCGC CCGTCCTGCA CGCGATCGCC CTGGGCTGTG ACTGGGACAA TGCCGGCAAG
ATCGCCCATG CCGACGGGAT GAGTGACGCC AACCCGGCCG CGATCGGTCT CGCCTGCCGC
CTGTGTGACC GCGAAGACTG CGCCCAGCGC GCCTTCCCGC CCCTCAACCG CAAGCTGACA
ATGGACCCGC ACCAGCTGCG GGCCTCGCCT TATGCGTTCG GGGAGAGTTG A
 
Protein sequence
MSDNQAFEKI FAGARLRRLR RERGMTQAEA AEALGLSASY LNLLERNQRP VTARVLLALA 
EAFDVDVRSF ANESDRQLIA DLTEAAADPV LAGLELDRVE LNELADSQPR AAEALTRLFQ
SYREMATATA DLATRMSGPG ATTGGPGVVL ESVRNAIDAH HNHFPDLEEA AEALSDRAGL
RSRNRDQALA AYLQDQHGFT VRVLDEDVMA GARRRLDFHG RRLLLSETLP PASRGFHMAV
VLASLEQADL LDSLCDQIDL PSAEGRRLLR IGLANYFAGA VQMPYAAFHR AAETNRYHLG
VLQRRFEASY EQVCHRLTTL QRPGARGLPF FMIRVDAAGN VSKRFGGGIM PFARAGGGCP
KWNLYDALRM PERILTQSFE LPDGTRMLSL ARGQSTQGPT GQPPVLHAIA LGCDWDNAGK
IAHADGMSDA NPAAIGLACR LCDREDCAQR AFPPLNRKLT MDPHQLRASP YAFGES