Gene Mmar10_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0474 
Symbol 
ID4284163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp553907 
End bp555985 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content63% 
IMG OID638139938 
Productendothelin-converting protein 1 
Protein accessionYP_755705 
Protein GI114569025 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00398267 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TCTTGCTGAG CACCGCCGTC CTGGCCCTCA CCGCTGCCTG TCAGCCGGCC 
ACGTCGACCG ATACGGCCGC CACCGAGACG CCCGCCGCCT CGACGCCCGC CGCCGCAACC
GCCAGTGCGG CCATCGGCGA TTGGGGTTTT GACATTTCCG GAATGGACAC CAGCTACGTC
GCCGGTGACG ACTTCTACCG CTACGCCAAT GGCACCTGGC TGGACACGAC GGAGATCCCG
TCCGACCGCT CGAACTACGG CATGTTCACC GAGCTGGCGA TCGAAGCCGA AGAACAGGTC
CAGGCGATCA TCCTCGAACT TGCCGCCATG GACGCGGCCG ACGGTTCGAT CGAACAGAAG
GTCGGCGACC TTTATGGCAG CTGGATGGAC ACCGCGACCA TCGACCAGCT CGGCCTCGCT
CCGGCGCAGC CTTATCTCGA CGAGATTGCT GCCGCCGAGA CCCATGCTGA TATCAATGCC
CTGTTTGCGA CGATCCATCA CCAGTCGCCC TATGGCGTCG GCATCATCCC CGACCCGGCA
GACACGACGC GCTACACCGT CTTCGTCGGC CAGGGCGGCC TCGGCCTGCC GGACCGCGAC
TACTATCTGG AAGAGGACAA TCAGGGCGCC CGTGACGCCT ATCTGGCCTT CATCGCCCAG
ATCTTTGACC TCGCCGGTAT CGAGGACGGC ACCGGCAAGG CGCAGGCGAT CTTCGATCTG
GAGATGCGCA TTGCCGAAAG CCACTGGACC CAGGCCGACA GCCGCAACAT CCAGATGATC
TACAACCCGA TGCCGCTCGA CCAGCTGAGC GCCCTGGCAC CGCAGCTGAG CTTCCAAGCC
GGCATGGAAC AGCTCGGGCT TGACGGTGTC GCGACCTATC TCGTCGCCCA GCCGAGCGCC
ATCGAGGCCG CCGGTACGAT CTTCGAAGAA ACACCTGTCG ACGTGTGGAA GGACTATATG
ACCTTCCATT ACATCCGCTC GAATGCCGGC GCACTGCCGG AAGCCTTCGA TGCCGCCAAC
TTCGCGATGT TCGGCACGAC GCTGAACGGA ATCGAAGAGC AGCGTCCCCG CGACCGCCGC
GGCGTCAATC TCGTCGGCGG CCAGCTGGGT GAAGCTGTCG GCCAGGTCTA TGTCGACCGT
CACTTCCCGC CGGAATCCAA AACCGCCATG GAAGCCCTGG TCGCCAATCT GATCGTCGCG
TTCGAGGGTC GCCTTGCAGC CCTCGAATGG ATGGATGAAG AAACCCGCGC CAATGCCCTG
CAGAAGCTGT CGACCTTCGA GCCGCGCATC GGCTATCCGG ACGAATGGCA GGACTATTCA
GCGCTTGAGG TCCGGGCCGA CGACCTGTTC GGCAATCTCG TTCGCCTGAC GGAGTTCCAG
TGGAACGAGC AGGTCGCCGA TCTGTCCGGT CCGGTCGACC GGAGCGCCTG GCCCTATCCG
CCGCAGACCG TCAACGCGTC CTACAACCCG CTGATGAACC AGATTACCTT CCCGGCCGGC
ATCCTGCAGG CGCCTTTCTT TGATCCAAAC GCCGATGCGG CGATCAATTA CGGCGCCATC
GGTGCGGTGA TCGGTCATGA AATCGGCCAC GGCTTCGACG ATCAGGGTCG CCGCTTCGAC
TATGATGGCT CGATCCGGGA CTGGTGGACT GTTGAGACGA ACGAGCGCTT CGAAGAGCGC
GCCGACATAC TGGAAGCTCA GTATGACGGC TATGAGCCGA TCGAGGGCAG CTTCGTGAAT
GGTGAATTCA CGATGGGTGA GAATATCGGC GACCTGGGTG GTCTGCAGAT GGCCTACACC
GCCTATCAGC GTCATCTCGA TGCGTGCTGC GACGGCGAAG CCCCGGTCAT TGACGGCTTC
ACCGGCGAGC AGCGCTTCTT CCTCGCCTGG GCTCAGGTCT GGCGCCGCCT GTATCGGGAA
GAAAACCTGC GCAACCGGCT CACGACCGAT CCGCACAGCC CGGCCCAATA CCGGACCAAT
GGCGTGGTCC GCAATCTGGA CGTCTGGTAC GAGGCCTTCG GCGTGACCGA GGACAATGAC
CTCTACCTGC CGCCGGAAGA GCGCGTCTCG ATCTGGTAG
 
Protein sequence
MKKLLLSTAV LALTAACQPA TSTDTAATET PAASTPAAAT ASAAIGDWGF DISGMDTSYV 
AGDDFYRYAN GTWLDTTEIP SDRSNYGMFT ELAIEAEEQV QAIILELAAM DAADGSIEQK
VGDLYGSWMD TATIDQLGLA PAQPYLDEIA AAETHADINA LFATIHHQSP YGVGIIPDPA
DTTRYTVFVG QGGLGLPDRD YYLEEDNQGA RDAYLAFIAQ IFDLAGIEDG TGKAQAIFDL
EMRIAESHWT QADSRNIQMI YNPMPLDQLS ALAPQLSFQA GMEQLGLDGV ATYLVAQPSA
IEAAGTIFEE TPVDVWKDYM TFHYIRSNAG ALPEAFDAAN FAMFGTTLNG IEEQRPRDRR
GVNLVGGQLG EAVGQVYVDR HFPPESKTAM EALVANLIVA FEGRLAALEW MDEETRANAL
QKLSTFEPRI GYPDEWQDYS ALEVRADDLF GNLVRLTEFQ WNEQVADLSG PVDRSAWPYP
PQTVNASYNP LMNQITFPAG ILQAPFFDPN ADAAINYGAI GAVIGHEIGH GFDDQGRRFD
YDGSIRDWWT VETNERFEER ADILEAQYDG YEPIEGSFVN GEFTMGENIG DLGGLQMAYT
AYQRHLDACC DGEAPVIDGF TGEQRFFLAW AQVWRRLYRE ENLRNRLTTD PHSPAQYRTN
GVVRNLDVWY EAFGVTEDND LYLPPEERVS IW