Gene Mmar10_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_2011 
Symbol 
ID4286704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp2192584 
End bp2193549 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content66% 
IMG OID638141512 
Productproline iminopeptidase 
Protein accessionYP_757241 
Protein GI114570561 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0118869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCTT CCTACTCACG CCGCCTGCTC TATCCGCCGA TCCGGCCCTT GCAGGCCTCG 
CGATTGGCGG TCGGCAACGG ACATGACCTC TACATTGAGG AGTGCGGCCG GCCGGATGGC
CTGCCAGTCG TGACCCTTCA CGGCGGTCCC GGTGGCGGCG TATCGCCGGC GCTCAGACGG
TTTTTCGACC CCCGGCGCTA TCGTGTCATC CTGTTTGACC AGCGCGGTTG TGGTCGTTCG
ACACCGCATG GCGGGCTCGA GCACAACACC ACGCAGGACC TGATCGACGA CATCGAGCGC
ATCCGCGAGG TGATGGGGAT CGACAAATGG GTCGTCTTCG GCGGCTCCTG GGGAGCGACA
CTCGCTCTTG CCTATGCCCG TGCCCATCCG GACCGTTGCA TCGGCCTGAT CCTGCGCGGC
ATCTTCACCT GCTCCCAGCG CGAGCTGGAC TGGTTTTACA AGGACGGCGC CAACATGTTG
TTCCCGGATG CCTGGGAACG ACTTGTCGAC CCGCTCAGCC CGGAAGAGCG CGGCGACATC
ATCCGCGCCT ATTACGAACG CCTCGCCGAG CCGGACATCA TCCGCCGCCG GCCGGATGCG
CTGGCCTGGG CGCGATGGGA AAGCGCCCTG ATCTCGATGA CCGGCGACCC GTCGGCACCG
CTGGCCGATC CGGTCCGCTC GGACGCCCTC GCCCGGCTGG AAAGCCACTA CTTCTTCCAC
AAGGGTTTCT TCCAGCGAGA TGGAGAGCTG ATCGAGGATG CCGAGCGCTA CAATCACCTG
CCCGGCGTGA TCGTGCAGGG ACGCTATGAC GTCGTGACTC CGCCCCAAAC AGCATGGAGC
CTCGCCCGGG CTTGGCCGCG AGCGAGGCTC CACATGATTG GCGATGCCGG CCATGCGGCC
GGCGAGCCGG GCGTGGTCGA CGCGCTGGTG CGCGCGACCG ACGCCTTTGC CGACAAGTTC
GCCTAG
 
Protein sequence
MDASYSRRLL YPPIRPLQAS RLAVGNGHDL YIEECGRPDG LPVVTLHGGP GGGVSPALRR 
FFDPRRYRVI LFDQRGCGRS TPHGGLEHNT TQDLIDDIER IREVMGIDKW VVFGGSWGAT
LALAYARAHP DRCIGLILRG IFTCSQRELD WFYKDGANML FPDAWERLVD PLSPEERGDI
IRAYYERLAE PDIIRRRPDA LAWARWESAL ISMTGDPSAP LADPVRSDAL ARLESHYFFH
KGFFQRDGEL IEDAERYNHL PGVIVQGRYD VVTPPQTAWS LARAWPRARL HMIGDAGHAA
GEPGVVDALV RATDAFADKF A