Gene Mmwyl1_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_4034 
Symbol 
ID5368221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp4557684 
End bp4560899 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content46% 
IMG OID640806427 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001342865 
Protein GI152998030 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00188727 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC GTACTGACAT AAAAAGCGTC TTAATTTTAG GTGCAGGCCC TATTGTTATC 
GGTCAGGCTT GTGAGTTTGA CTATTCTGGA GCGCAAGCGT GTAAAGCGCT TCGTGAAGAA
GGCATTCGCG TTATTCTGGT GAACTCAAAC CCAGCGACCA TCATGACAGA CCCAGTTATG
GCGGATGCGA CTTACATCGA GCCAGTTGAA TGGAAAACCG TAGAAAAAAT CATTGAAAAA
GAGCGCCCAG ATGCGGTTTT ACCGACCATG GGTGGTCAAA CGGCATTGAA CTGTGCGTTG
GATCTTGAGC GTCACGGCGT GCTAGAAAAA TACAATGTTG AGATGATCGG TGCGACAGCT
GATGCGATCG ACAAAGCGGA AGATCGTAGC CGTTTTGATA AGGCAATGCG TGCGATTGGT
CTTGAGTGTC CGCGTGCGGG TATTGCTCAC AACATGGAAG AAGCGCTTAA AGTTCAAGCT
GAAGTAGGCT TCCCTTGTAT TATTCGTCCT TCTTTCACTA TGGGTGGTAC CGGTGGTGGT
ATCGCGTACA ACATGGAAGA GTTTGACGAA ATCTGTACTC GTGGTTTGGA CTTGTCTCCA
ACCAATGAAT TGCTAATCGA TGAATCTTTG ATCGGTTGGA AAGAGTACGA GATGGAAGTT
GTCCGTGATA AAAAAGACAA CTGCATCATT GTTTGTGCGA TTGAGAACTT CGACGCCATG
GGGGTTCACA CAGGTGACTC TATTACAGTG GCGCCAGCAC AAACACTGAC TGATAAAGAA
TATCAAATCA TGCGTAATGC TTCTTTGGCG GTATTGCGTG AGATCGGCGT AGAAACCGGT
GGTTCCAACG TACAGTTCGG TATGGATCCA AAAACAGGTC GTCTTGTTGT TATCGAGATG
AACCCTCGTG TATCTCGTTC ATCTGCTTTG GCATCGAAAG CGACTGGTTT TCCAATCGCA
AAAATCGCGG CGAAATTGGC GATTGGCTAC ACGCTTGATG AGTTGCAAAA CGATATCACT
GGCGGCCAAA CACCAGCGAG CTTCGAGCCA GCAATCGACT ACGTTGTGAC TAAGATTCCT
CGTTTCACGT TCGAAAAATT CCCAACAGCG AATGATCGTC TAACTACGCA AATGAAGTCG
GTTGGTGAAG TTATGGCGAT TGGCCGTACT TTCCAAGAGT CTTTGCAAAA AGCATTGCGC
GGCTTGGAAG TTGGTTCTGA TGGCTTCAAT CCTCAGTTGG ATTTTGCTGA AGAAAACAGC
AAAGAGAAAT TGGCTTACGA GCTTCAATCT CCTGGTTCTG ACCGTATTTG GTACATCGGT
GATGCCTTCC GTTCTGGTAT GACTGTCGAT GAAGTGTACG AAGCAACAGG TGTTGATCAT
TGGTTCTTGG TACAAATCGA AGACTTGATC AAAGAAGAAG CTGCATTGGC AGACAAAGGT
CTGATCGATA TGACTTACGA CGTGATTCGT CGTTTGAAGC GTAAAGGTTT CTCTGATGCG
CGTCTTGCTA GCTTGTTAAG CGTGACTGAA AAGTCTATGC GTGAGCGTCG TTATTTGATG
AACGTTCATC CAGTTTACAA GCGTGTTGAT ACTTGTGCGG CAGAGTTTGC CACTAACACA
GCGTACATGT ACTCAACGTA TGAAGATGAA TGTGAAGCTG CACCAACTGA TCGTGAAAAA
ATCATCATCC TTGGTGGTGG TCCAAACCGT ATTGGCCAAG GTATCGAGTT CGACTACTGC
TGTGTACACG CAGCTCTAGG TCTACGTGAA GACGGTTACG AAACCATTAT GGTGAACTGT
AACCCTGAAA CGGTATCAAC TGATTACGAC ACTTCTGACC GTTTGTACTT CGAGCCAGTA
ACGCTTGAGG ACGTGTTAGA AATCGTTCGC AAAGAAAAGC CAAAAGGCGT AATTGTTCAA
TTCGGTGGTC AAACCCCGCT GAAAATCGCT CGTGCATTGC AAAACGAAGG CGTGCCAATC
ATAGGTACAA CGCCTGAGTC TATCGACCGT GCAGAAGATC GTGAACGTTT CCAAAGCATG
ATCCAGCGTT TAGGTTACAA ACAGCCTCAT AACGCGACAG TGCGTAGCGT TGATCAAGCG
GCAGCGAAAG CTGCACTTAT CGGCTACCCA CTTGTGGTAC GTCCATCCTA TGTATTGGGT
GGCCGTGCGA TGGAAATCGT TTATAACGAA AAAGAATTGA TGCGTTACAT GACCAGCGCG
GTGAAAGTGT CTAACGATAG CCCTGTTCTG CTAGACCACT TCTTGAATGC AGCGATTGAA
ATTGATATTG ACTGTATCAG TGATGGTCAT CAAGTGGTTA TTGGCGGCAT CATGCAACAT
ATCGAACAAG CGGGTGTTCA CTCAGGTGAC TCAGCATGCT CTTTGCCACC ATATTCTTTG
TCGAAAGAAG TGCAAGACGA CATTCGTGAG ATGATCAAAA ACATGGCGCT AGAACTTGGC
GTTGTCGGTT TGATGAACAC TCAGCTTGCG ATTCAGGATG GCGAAATTTA TGTGATCGAG
GTGAACCCTC GTGCATCACG TACTGTGCCT TTCGTTTCCA AGTGTATCGG TCGCTCTTTA
GCACAAGTTG CCGCTTTGAT AATGGCAGGT AAAACACTGG AAGAGCTTGG TTTCACCAAA
GAAATCATTC CTTCTTACTA CAGTGTGAAA GAAGCTGTTT TCCCATTCAA CAAGTTCCAA
GGTGTCGATC CGATTCTAGG GCCTGAAATG AAGTCTACGG GCGAAGTGAT GGGCGTGGGC
GATACTTTCG CTGAGGCTTT CGGTAAAGCG GTTCTTGGTG GTGGTACTGA ATTGCCAACC
TCAGGTCGTG CTTTTATCAG TGTTCGCGAT ATGGACAAAG AAGGTGCAGT AGAAGTCGCT
CGTCGCTTGG CTGAATTAGG ATTCGACCTT GTCGGAACCG AAGGTACAGC TAAATACCTA
ACTGAGCGCG GCGTTGAAGT TCGTAAAGTG AATAAGGTAA ATGAAGGTCG CCCGCATATT
GTTGATATGA TGAAAAATGG CGAAATTGAT TACATCATCA ACACCACGTC CGGTACGCAA
GCGATTGCAG ATTCTTCTGT TATTCGTCGT ACAGCTTTAC AGCGCAAGGT TTGTTACACT
ACGACATTGG CTGGTGCTGA AGCAACGAGT ATGGCGATTA GCCTAACGGG TGAAACGAAA
GTTAGAAGAC TGCAAGATTT GCACTTGGGG AAATAA
 
Protein sequence
MPKRTDIKSV LILGAGPIVI GQACEFDYSG AQACKALREE GIRVILVNSN PATIMTDPVM 
ADATYIEPVE WKTVEKIIEK ERPDAVLPTM GGQTALNCAL DLERHGVLEK YNVEMIGATA
DAIDKAEDRS RFDKAMRAIG LECPRAGIAH NMEEALKVQA EVGFPCIIRP SFTMGGTGGG
IAYNMEEFDE ICTRGLDLSP TNELLIDESL IGWKEYEMEV VRDKKDNCII VCAIENFDAM
GVHTGDSITV APAQTLTDKE YQIMRNASLA VLREIGVETG GSNVQFGMDP KTGRLVVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAIGY TLDELQNDIT GGQTPASFEP AIDYVVTKIP
RFTFEKFPTA NDRLTTQMKS VGEVMAIGRT FQESLQKALR GLEVGSDGFN PQLDFAEENS
KEKLAYELQS PGSDRIWYIG DAFRSGMTVD EVYEATGVDH WFLVQIEDLI KEEAALADKG
LIDMTYDVIR RLKRKGFSDA RLASLLSVTE KSMRERRYLM NVHPVYKRVD TCAAEFATNT
AYMYSTYEDE CEAAPTDREK IIILGGGPNR IGQGIEFDYC CVHAALGLRE DGYETIMVNC
NPETVSTDYD TSDRLYFEPV TLEDVLEIVR KEKPKGVIVQ FGGQTPLKIA RALQNEGVPI
IGTTPESIDR AEDRERFQSM IQRLGYKQPH NATVRSVDQA AAKAALIGYP LVVRPSYVLG
GRAMEIVYNE KELMRYMTSA VKVSNDSPVL LDHFLNAAIE IDIDCISDGH QVVIGGIMQH
IEQAGVHSGD SACSLPPYSL SKEVQDDIRE MIKNMALELG VVGLMNTQLA IQDGEIYVIE
VNPRASRTVP FVSKCIGRSL AQVAALIMAG KTLEELGFTK EIIPSYYSVK EAVFPFNKFQ
GVDPILGPEM KSTGEVMGVG DTFAEAFGKA VLGGGTELPT SGRAFISVRD MDKEGAVEVA
RRLAELGFDL VGTEGTAKYL TERGVEVRKV NKVNEGRPHI VDMMKNGEID YIINTTSGTQ
AIADSSVIRR TALQRKVCYT TTLAGAEATS MAISLTGETK VRRLQDLHLG K