Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_4034 |
Symbol | |
ID | 5368221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | - |
Start bp | 4557684 |
End bp | 4560899 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640806427 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_001342865 |
Protein GI | 152998030 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00188727 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACTGACAT AAAAAGCGTC TTAATTTTAG GTGCAGGCCC TATTGTTATC GGTCAGGCTT GTGAGTTTGA CTATTCTGGA GCGCAAGCGT GTAAAGCGCT TCGTGAAGAA GGCATTCGCG TTATTCTGGT GAACTCAAAC CCAGCGACCA TCATGACAGA CCCAGTTATG GCGGATGCGA CTTACATCGA GCCAGTTGAA TGGAAAACCG TAGAAAAAAT CATTGAAAAA GAGCGCCCAG ATGCGGTTTT ACCGACCATG GGTGGTCAAA CGGCATTGAA CTGTGCGTTG GATCTTGAGC GTCACGGCGT GCTAGAAAAA TACAATGTTG AGATGATCGG TGCGACAGCT GATGCGATCG ACAAAGCGGA AGATCGTAGC CGTTTTGATA AGGCAATGCG TGCGATTGGT CTTGAGTGTC CGCGTGCGGG TATTGCTCAC AACATGGAAG AAGCGCTTAA AGTTCAAGCT GAAGTAGGCT TCCCTTGTAT TATTCGTCCT TCTTTCACTA TGGGTGGTAC CGGTGGTGGT ATCGCGTACA ACATGGAAGA GTTTGACGAA ATCTGTACTC GTGGTTTGGA CTTGTCTCCA ACCAATGAAT TGCTAATCGA TGAATCTTTG ATCGGTTGGA AAGAGTACGA GATGGAAGTT GTCCGTGATA AAAAAGACAA CTGCATCATT GTTTGTGCGA TTGAGAACTT CGACGCCATG GGGGTTCACA CAGGTGACTC TATTACAGTG GCGCCAGCAC AAACACTGAC TGATAAAGAA TATCAAATCA TGCGTAATGC TTCTTTGGCG GTATTGCGTG AGATCGGCGT AGAAACCGGT GGTTCCAACG TACAGTTCGG TATGGATCCA AAAACAGGTC GTCTTGTTGT TATCGAGATG AACCCTCGTG TATCTCGTTC ATCTGCTTTG GCATCGAAAG CGACTGGTTT TCCAATCGCA AAAATCGCGG CGAAATTGGC GATTGGCTAC ACGCTTGATG AGTTGCAAAA CGATATCACT GGCGGCCAAA CACCAGCGAG CTTCGAGCCA GCAATCGACT ACGTTGTGAC TAAGATTCCT CGTTTCACGT TCGAAAAATT CCCAACAGCG AATGATCGTC TAACTACGCA AATGAAGTCG GTTGGTGAAG TTATGGCGAT TGGCCGTACT TTCCAAGAGT CTTTGCAAAA AGCATTGCGC GGCTTGGAAG TTGGTTCTGA TGGCTTCAAT CCTCAGTTGG ATTTTGCTGA AGAAAACAGC AAAGAGAAAT TGGCTTACGA GCTTCAATCT CCTGGTTCTG ACCGTATTTG GTACATCGGT GATGCCTTCC GTTCTGGTAT GACTGTCGAT GAAGTGTACG AAGCAACAGG TGTTGATCAT TGGTTCTTGG TACAAATCGA AGACTTGATC AAAGAAGAAG CTGCATTGGC AGACAAAGGT CTGATCGATA TGACTTACGA CGTGATTCGT CGTTTGAAGC GTAAAGGTTT CTCTGATGCG CGTCTTGCTA GCTTGTTAAG CGTGACTGAA AAGTCTATGC GTGAGCGTCG TTATTTGATG AACGTTCATC CAGTTTACAA GCGTGTTGAT ACTTGTGCGG CAGAGTTTGC CACTAACACA GCGTACATGT ACTCAACGTA TGAAGATGAA TGTGAAGCTG CACCAACTGA TCGTGAAAAA ATCATCATCC TTGGTGGTGG TCCAAACCGT ATTGGCCAAG GTATCGAGTT CGACTACTGC TGTGTACACG CAGCTCTAGG TCTACGTGAA GACGGTTACG AAACCATTAT GGTGAACTGT AACCCTGAAA CGGTATCAAC TGATTACGAC ACTTCTGACC GTTTGTACTT CGAGCCAGTA ACGCTTGAGG ACGTGTTAGA AATCGTTCGC AAAGAAAAGC CAAAAGGCGT AATTGTTCAA TTCGGTGGTC AAACCCCGCT GAAAATCGCT CGTGCATTGC AAAACGAAGG CGTGCCAATC ATAGGTACAA CGCCTGAGTC TATCGACCGT GCAGAAGATC GTGAACGTTT CCAAAGCATG ATCCAGCGTT TAGGTTACAA ACAGCCTCAT AACGCGACAG TGCGTAGCGT TGATCAAGCG GCAGCGAAAG CTGCACTTAT CGGCTACCCA CTTGTGGTAC GTCCATCCTA TGTATTGGGT GGCCGTGCGA TGGAAATCGT TTATAACGAA AAAGAATTGA TGCGTTACAT GACCAGCGCG GTGAAAGTGT CTAACGATAG CCCTGTTCTG CTAGACCACT TCTTGAATGC AGCGATTGAA ATTGATATTG ACTGTATCAG TGATGGTCAT CAAGTGGTTA TTGGCGGCAT CATGCAACAT ATCGAACAAG CGGGTGTTCA CTCAGGTGAC TCAGCATGCT CTTTGCCACC ATATTCTTTG TCGAAAGAAG TGCAAGACGA CATTCGTGAG ATGATCAAAA ACATGGCGCT AGAACTTGGC GTTGTCGGTT TGATGAACAC TCAGCTTGCG ATTCAGGATG GCGAAATTTA TGTGATCGAG GTGAACCCTC GTGCATCACG TACTGTGCCT TTCGTTTCCA AGTGTATCGG TCGCTCTTTA GCACAAGTTG CCGCTTTGAT AATGGCAGGT AAAACACTGG AAGAGCTTGG TTTCACCAAA GAAATCATTC CTTCTTACTA CAGTGTGAAA GAAGCTGTTT TCCCATTCAA CAAGTTCCAA GGTGTCGATC CGATTCTAGG GCCTGAAATG AAGTCTACGG GCGAAGTGAT GGGCGTGGGC GATACTTTCG CTGAGGCTTT CGGTAAAGCG GTTCTTGGTG GTGGTACTGA ATTGCCAACC TCAGGTCGTG CTTTTATCAG TGTTCGCGAT ATGGACAAAG AAGGTGCAGT AGAAGTCGCT CGTCGCTTGG CTGAATTAGG ATTCGACCTT GTCGGAACCG AAGGTACAGC TAAATACCTA ACTGAGCGCG GCGTTGAAGT TCGTAAAGTG AATAAGGTAA ATGAAGGTCG CCCGCATATT GTTGATATGA TGAAAAATGG CGAAATTGAT TACATCATCA ACACCACGTC CGGTACGCAA GCGATTGCAG ATTCTTCTGT TATTCGTCGT ACAGCTTTAC AGCGCAAGGT TTGTTACACT ACGACATTGG CTGGTGCTGA AGCAACGAGT ATGGCGATTA GCCTAACGGG TGAAACGAAA GTTAGAAGAC TGCAAGATTT GCACTTGGGG AAATAA
|
Protein sequence | MPKRTDIKSV LILGAGPIVI GQACEFDYSG AQACKALREE GIRVILVNSN PATIMTDPVM ADATYIEPVE WKTVEKIIEK ERPDAVLPTM GGQTALNCAL DLERHGVLEK YNVEMIGATA DAIDKAEDRS RFDKAMRAIG LECPRAGIAH NMEEALKVQA EVGFPCIIRP SFTMGGTGGG IAYNMEEFDE ICTRGLDLSP TNELLIDESL IGWKEYEMEV VRDKKDNCII VCAIENFDAM GVHTGDSITV APAQTLTDKE YQIMRNASLA VLREIGVETG GSNVQFGMDP KTGRLVVIEM NPRVSRSSAL ASKATGFPIA KIAAKLAIGY TLDELQNDIT GGQTPASFEP AIDYVVTKIP RFTFEKFPTA NDRLTTQMKS VGEVMAIGRT FQESLQKALR GLEVGSDGFN PQLDFAEENS KEKLAYELQS PGSDRIWYIG DAFRSGMTVD EVYEATGVDH WFLVQIEDLI KEEAALADKG LIDMTYDVIR RLKRKGFSDA RLASLLSVTE KSMRERRYLM NVHPVYKRVD TCAAEFATNT AYMYSTYEDE CEAAPTDREK IIILGGGPNR IGQGIEFDYC CVHAALGLRE DGYETIMVNC NPETVSTDYD TSDRLYFEPV TLEDVLEIVR KEKPKGVIVQ FGGQTPLKIA RALQNEGVPI IGTTPESIDR AEDRERFQSM IQRLGYKQPH NATVRSVDQA AAKAALIGYP LVVRPSYVLG GRAMEIVYNE KELMRYMTSA VKVSNDSPVL LDHFLNAAIE IDIDCISDGH QVVIGGIMQH IEQAGVHSGD SACSLPPYSL SKEVQDDIRE MIKNMALELG VVGLMNTQLA IQDGEIYVIE VNPRASRTVP FVSKCIGRSL AQVAALIMAG KTLEELGFTK EIIPSYYSVK EAVFPFNKFQ GVDPILGPEM KSTGEVMGVG DTFAEAFGKA VLGGGTELPT SGRAFISVRD MDKEGAVEVA RRLAELGFDL VGTEGTAKYL TERGVEVRKV NKVNEGRPHI VDMMKNGEID YIINTTSGTQ AIADSSVIRR TALQRKVCYT TTLAGAEATS MAISLTGETK VRRLQDLHLG K
|
| |