Gene Mmar10_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0441 
Symbol 
ID4285054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp518605 
End bp520188 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content64% 
IMG OID638139904 
Producthypothetical protein 
Protein accessionYP_755672 
Protein GI114568992 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.870826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGTC CATGTGCCAG CCTGGTGGCC GTGCGGGGCG GATGGCTGGC CGCACCGGGG 
CGACTCGTCC AGAATGACAG CGAACAAGAA GGCGATTGCA TCGTGACTTT CTCAGACATC
TCCCCGGAAA CCTGGCTGAT CCTGGCCGGC ATGGTCGTCC TGCTTGCAGT GCTCGCTGGC
GGCCTGTTCC TGCAACGAAA GCCCGACCCG TCCGGCACGA CCGATAACCC GGAAGTGGAA
CGGCTGCGCA GCGAACGCGA CGACTGGCGC AACCGTCAGG AAAGTGCTGC CTCCGAACTG
GCGACGGCGC GCGAGGCGCT GGCCCGGCTT GATGGTGTGG TCGAGGAGCG CGACCGCCTG
CGGGTCTCGC TGGAAGCGGT CACGGTCGAG CGCAACGGGC TCAATGCCAG CCATGAAGCG
CTCAAGACCG AGCATGACGC GGCCCTGCGC CATCATGATG AAAAGCTGGC CGAGCTGGAA
AAGGCCCGCG AGCGTCTCAA CGACCAGTTC AAGCTGACCG CCGCGGAAAT CCTGAAAGCC
TCGGGTGCCG AGCTGAACAA GCAGAGCACG GAAAGCCTGC AAACCCTGCT CAAGCCGCTG
CGCGAACAAC TCACCGATTT CCGCTCGAAA GTGGAGAAGG ATGCCGAGCA ACGCCACGGT
CATGCCGGCG AGATCAAGCA ATTGATGGAG ACCGTGCGCA AGGACGCCTC GCGGATGTCG
GACGACGCCC AGAAGCTGGC CAATGCGCTG CGCTCCTCCT CCAAGGTGCA GGGCGATTGG
GGCGAGATGG TGCTGGCCTC GATCCTCGAA CGCGCCGGCC TGCTGGAGAA TGAGCACTTC
CACACCCAGT CCACCGAGCG CAATGCCGAG GGTGCGCTTT TGCGTCCCGA CGTGATCGTC
GACATGCCGG GCGGGCATCG CCTGGTGATC GACAGCAAGG TCTCGCTGGT CGCGTTCGAG
CGCTGCGTGA ATGCCGAGGA TGACGAGATC CGGACGGCAG CGCTGAAACA GCACATCGCC
TCGGTCCGCG CCCATATCAA AGCATTGGGC GAGAAGGATT ATGCCAAGCT CTATGAGGGC
GTGAACTTCA CCTTGATGTT CATTCCGCTG GAAGGCGCGG CCTCGCTGGC GCTGCAGAAT
GACCCGGAAT TGTCGGCCTA TGCCTGGGAC CGCAATGTGA TGATCGCGAC ACCCACGACC
TTGATGATGG CCATGCGCAC GGTTGGAAAC CTGTGGACGA TCGACCGCCA GAACCGGCAC
GCCATCGACA TCGCCGATCG TGCCGGCGCC CTCTATGACA AGTTTGAAGG CTTCGTCGGC
GATCTCGAGA CGGTCGGCAA GCGCATCGAC GGGGCCAAGG ACGCCTGGGC GGCGGCCAAG
GGCAAGCTGG TTGAGGGTCG TGGCAATCTG GTGCGCCAGA CCGAAATGCT CAAATCTCTG
GGCGCCAATG CCCGCAAATC ACTGCCGGCC GAGTATCTCG ATGCGGCGGG CGCCGATGAA
GCGACCGAGA GCGGCGGTGG CAAATCCGCC CCCAAGCCAG CCCCCAAGCC AGCCCTCGCA
GCACCGGAAG CGGTCGATTC TTGA
 
Protein sequence
MSGPCASLVA VRGGWLAAPG RLVQNDSEQE GDCIVTFSDI SPETWLILAG MVVLLAVLAG 
GLFLQRKPDP SGTTDNPEVE RLRSERDDWR NRQESAASEL ATAREALARL DGVVEERDRL
RVSLEAVTVE RNGLNASHEA LKTEHDAALR HHDEKLAELE KARERLNDQF KLTAAEILKA
SGAELNKQST ESLQTLLKPL REQLTDFRSK VEKDAEQRHG HAGEIKQLME TVRKDASRMS
DDAQKLANAL RSSSKVQGDW GEMVLASILE RAGLLENEHF HTQSTERNAE GALLRPDVIV
DMPGGHRLVI DSKVSLVAFE RCVNAEDDEI RTAALKQHIA SVRAHIKALG EKDYAKLYEG
VNFTLMFIPL EGAASLALQN DPELSAYAWD RNVMIATPTT LMMAMRTVGN LWTIDRQNRH
AIDIADRAGA LYDKFEGFVG DLETVGKRID GAKDAWAAAK GKLVEGRGNL VRQTEMLKSL
GANARKSLPA EYLDAAGADE ATESGGGKSA PKPAPKPALA APEAVDS