Gene Mmar10_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1404 
Symbol 
ID4284636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1539658 
End bp1541595 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content62% 
IMG OID638140886 
Producthypothetical protein 
Protein accessionYP_756634 
Protein GI114569954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.15542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00031437 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTTCCA ATTTTCGTGC TTTCGCCCAG TCGCCCATAG CGCTGATCAT CATTGTCCTG 
CTGGTATTGT CTTTCGCGAT TGCCATGCCG GGCGCCGGCG GCATCTTCAC CGGCAGCGGT
GATGCAGTGG TCGTGGTTGG CCCCGAACGG ATCAGCCAGC GGGAAATGGC GACAGCCTTC
AATCGCGAGG TTGCCCGCCT GCAAGAGCAG AACCCGGACG TGACCCGCGA AATGGCGCGC
GAGGAAGGGA TCGCCTACCA GGTCCTGCAA CAGCAGATCA CCATGGCAAC CATGGCCGCC
CGGGCCCGGG ATCTGGGCCT GGCCATATCC GATGTCGCCA TCGTCAGCGA AGTTGCCGAC
GTCCCGGCCT TCCAGAATCC GATCACGGAA CGCTTCGATC GTGACACCAT GGCGGCCGCG
CTGCAGCGCT CGGGCGTGAC CGAAGACCAA TTCGCCCGTG ATATCGAAGG TGACCTGTTG
CGGTCGCAAT TGATGGCCAC CCTGGCCGGC TTCAGCGATG TCCCCGACCA GATCGCCGCG
ACCCGCTATC TCGTTGCCGA GGAACAGCGC CGGATGACCG GCCTCGTGAT CGACGCCTCG
ACCGCCGACG AGATCGAGGA CCCGACGGAT GAGACCCTGC AAACCTTCAT TGACGAGACC
CTCGGCGCCA ATGGCGAGCC GGTCTTCACT CGCCCGGAAT ACCGCGCCAT CACGCTCGTG
CGTTTCCAGC TCGATGACTT CATCCGCGAT GTCGCTGTTG ATGAAGCGAC CCTGCGGGAG
GTCTATGACT ATGAAATTGC GACCAACCAG ATCGGGACGC CCGCCTATCG CAGCTTCACC
CAGCTGACCG TCAGTGATCA GGCGGGTGCC GATGACGCAG CCAGCCGTCT CGCCGCGGGC
GAAGCCCCGT CTGCCGTTGC CGCCGAGCTG GGCCTCGACA CGCCGCTGGT CCAGACCGAT
GTCCAGCAAT TCGAGATCCC CGACGAGGGT CTTGGTGAAA CCGTATTTGC CATGTCGGCC
GGTGAGGCCC GCGCGGTTGA GGGACGTTTT GGCTGGAGCG CCGTGATTAT TACTGGCGCC
GAAGAGGCGA CCCAGCCTGC ATTTGAGGAA GAGCGTCCGC GGCTGCAGGC CGATGCGGCC
CGGGCCCAGG CGACGGACGA CATGTATGAT GCCATCTCGG CGTTCGAAAC CGTTCGCGCA
ACCGGTGCTT CGCTCGAGGT CGCTGCCGAG GAATCCGGTA CGCCGCTGGA AATCTTCCAG
CCTCTCGCCG TGAATTCGAT CGACGAGGAC CTGCAATTTG ATGGCGAACG CTACCAGGCA
CTCGCCCCTG AAATCCTGCC GGCGGCCTTC GATCACATCG AGGGATTCGC CACCAATCTG
GAAAGCTATA ACGAGACCGA TTTCTACACG CTTCGCGTCG ACGAGATCAT CCAGAGCCGT
CCGTTCGAAC TGGAAGAAAT CCGTGAACAG GCCGAGAGCC GCTGGCGCTC GATCCAGGTC
GACACCCAAT TGCAGGCTCG TGCCGAGGAT GCGCTGGCCC AGCTTGAAGC CGGTTCCGAC
ATGGAAATCG TCAGCCTGAT GGTTGGTGGG CGCACGGAAA CCTCGACCCT GAAACGCGGT
CAGACGGCGG GTGGCTTTGA TCGCAATGCG GTTGCCCTCG CCTTCACGAC GGACCCTGGC
GCCTACGAAA TGATTCAGGT GGGCGAAGGC CAATACCTGG TTCTCACCGT CAACGAGATC
ATCCCGGCCG ACATCGCCGC GGCCCCCGCC GCCGACCTTG CTGGGATCGA AACCAGCCTG
ACCACCGAGC TGGGCAATGA CATCGTTTCG GCGACCCGGG AATACCTGAT CCGCGATTAC
GGTATCACGG ACGAGTCGAT CGACAATCGC CTGTATTCGC TCGCCATTGG TGAGACCGAT
CCCAGCACAC GGCAATGA
 
Protein sequence
MLSNFRAFAQ SPIALIIIVL LVLSFAIAMP GAGGIFTGSG DAVVVVGPER ISQREMATAF 
NREVARLQEQ NPDVTREMAR EEGIAYQVLQ QQITMATMAA RARDLGLAIS DVAIVSEVAD
VPAFQNPITE RFDRDTMAAA LQRSGVTEDQ FARDIEGDLL RSQLMATLAG FSDVPDQIAA
TRYLVAEEQR RMTGLVIDAS TADEIEDPTD ETLQTFIDET LGANGEPVFT RPEYRAITLV
RFQLDDFIRD VAVDEATLRE VYDYEIATNQ IGTPAYRSFT QLTVSDQAGA DDAASRLAAG
EAPSAVAAEL GLDTPLVQTD VQQFEIPDEG LGETVFAMSA GEARAVEGRF GWSAVIITGA
EEATQPAFEE ERPRLQADAA RAQATDDMYD AISAFETVRA TGASLEVAAE ESGTPLEIFQ
PLAVNSIDED LQFDGERYQA LAPEILPAAF DHIEGFATNL ESYNETDFYT LRVDEIIQSR
PFELEEIREQ AESRWRSIQV DTQLQARAED ALAQLEAGSD MEIVSLMVGG RTETSTLKRG
QTAGGFDRNA VALAFTTDPG AYEMIQVGEG QYLVLTVNEI IPADIAAAPA ADLAGIETSL
TTELGNDIVS ATREYLIRDY GITDESIDNR LYSLAIGETD PSTRQ