Gene Noc_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2572 
Symbol 
ID3704576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2923737 
End bp2926958 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table11 
GC content51% 
IMG OID637739052 
Productcarbamoyl-phosphate synthase, large subunit, glutamine-dependent 
Protein accessionYP_344555 
Protein GI77166030 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0164695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAAC GGAACGATAT TAATAGCATA GTTATTCTCG GTGCCGGACC TATTGTGATT 
GGTCAGGCCT GTGAGTTTGA CTATTCCGGT GCCCAAGCTT GCAAGTCTCT CAAAGAGGAA
GGCTATCGGG TGATTCTGGT GAATTCTAAT CCAGCAACCA TCATGACTGA TCCAGAACTA
GCCGATGCTA CTTACATTGA GCCGATTACC TGGAAAACGG TGGCCAAAAT CATTGAAAAG
GAGCGCCCCG ATGCATTATT ACCCACCATG GGCGGGCAAA CGGCTTTAAA CTGCGCACTG
GATTTGGCTC GCGAAGGAAT TCTGGAACGT TATGACGTGG AAATGATAGG AGCGAACCAG
GAAGCCATTA ATAAAGCCGA AGACCGGGAT CTCTTTCGCC AGGCGATGCA GAAAATCGGC
TTAGATATGC CCCGTTCGGC GATTGCCCAT AGCCTAGAAG AAGCGCAGCA GGCTCAGAGA
GATTTAGGAT TCCCGATTGT TATTCGGCCG TCGTTTACTT TGGGGGGCTC GGGGGGAGGG
ATTGCCTATA ACCGGGAAGA ATTCGTGGAA ATCTGTGAGC GAGGTCTAGA TCTCAGCCCC
AATTCAGAGC TGCTCATCGA TGAGTCTGTG CTTGGTTGGA AGGAATTTGA GATGGAGGTG
GTGCGGGATA GTAAGGATAA CTGCATCATT ATTTGCACTA TTGAAAATTT AGATCCGATG
GGCGTGCATA CGGGGGATTC CATTACCGTG GCGCCTGCTC AAACCCTTAC GGATAAGGAA
TACCAGGTGA TGCGCGATGC CGCCCTTGCA GTATTGCGCG AGATTGGGGT GGATACGGGT
GGCTCCAACG TGCAGTTCGC CGTTGATCCG GAGAATGGGC GGTTAATCAT TATCGAAATG
AACCCCCGGG TCTCCCGTTC TTCAGCCTTA GCTTCTAAAG CCACGGGTTT TCCCATTGCG
AAGGTGGCCG CAAAACTAGC CGTTGGCTAT ACGCTGGATG AGCTTCAGAA TGAAATTACT
GGTGGTCTGA CTCCGGCTTC CTTCGAGCCT AGTATTGATT ATGTGGTTAC GAAGATACCC
CGTTTTACCT TCGAAAAATT CCCCCGAGCG GATGCCCGCT TGACCACGCA GATGAAGTCA
GTAGGTGAAG TGATGGCTAT TGGCCGGAAC TTTCAAGAGT CCCTACAGAA GGCGCTCCGC
AGCTTAGAGA CGGGTACAGA TGGTTTCAAT GAAAAAGTGG ATCTGGCAGC GGAAGACGTT
GTGGAAACCT TACGGTACCA GCTACGGGTA CCCTCCGCGG ACAGGATCTA TTATCTTGCG
GATGCTTTCC GGGCTGGGTT TTCCATGGCG GAAATTTATG ACTTAAGTCA TATTGATCCC
TGGTTCTTGA CTCAGATTCA AGACTTAGTT GAAGTTGAGC AAGGATTGCG GGGTACTTCC
CTGGAGCAAG TGGAAAAAGA TCGACTCTAT CGTTTAAAGC GCCAGGGTTT TTCAGATCGT
AGGTTGGCCG TTCTCCTTGG AGCTGGGGAA GAGGAGGTAC GGCGGCGGCG GCATACGCTT
GGTATCCGCC CCGTTTATAA GCGAGTGGAT ACTTGCGGGG CTGAATTTGC TACCACCACA
GCTTATCTTT ACTCCACCTA CGATGAGGAG TGCGAGGCGG TCCCTAGTCA GCGTAATAAG
ATCATGGTAC TTGGAGGTGG ACCCAACCGC ATTGGTCAGG GGATTGAATT TGATTACTGT
TGCGTGCATG CCGCGTTGGC ATTACGGGAA GATGGTTATG AGACCATCAT GGTCAATTGC
AACCCGGAGA CGGTTTCCAC CGACTATGAT ACTTCCGACC GGCTCTACTT CGAGCCTTTG
ACTCTAGAAG ATGTTCTGGA GATTATTTCC TTGGAGCAGC CCCAAGGGGT TATCGTTCAA
TACGGCGGCC AAACGCCCCT CAAGCTAGCG CGTGCTTTGG AGGCGGCAGG GGCGCCGATT
ATCGGTACCT CCCCGGACTC TATCGATTTG GCCGAGGATC GGGAACGGTT TCAATGGCTC
ATTGAGAAAC TTGGGCTTAA GCAGCCTCCT AACCGTACTG CTCGCACCCA GGAAGAGGCT
ATCCGCTTAG CCGCTGAAAT TGGCTACCCT TTGGTGGTCA GGCCCTCCTA TGTACTTGGG
GGGCGCGCCA TGGAAATCGT TTACAATGCG GACGAACTCG CCAGGTACAT GAAAGAGGCG
GTGAGTGTTT CTAACGATTC TCCGGTGCTA CTGGACCGGT TTTTAGATGA AGCGACAGAG
GTGGATGTCG ATGCGATCTG TGATGGAGAG CAGGTCATGG TCGGTGGTAT CATGGAGCAT
ATTGAACAGG CCGGTGTCCA TTCTGGCGAC TCGGCATGCG CTTTACCGCC TTTTAGTTTG
AGCGCCGCGG TGCAGAACCG GCTGCGGGAG CAAGTGTATA ATATGGCTCG TGAGCTTAAG
GTGGTGGGTT TAATGAATAC CCAGTTCGCT ATCCAGGCAA ACGAGATCTA TATACTTGAG
GTTAATCCTC GCGCTTCCCG GACCGTGCCC TTTGTTTCCA AAACGATTGG TATTCCCCTT
GCCAAGGTTG CCGCCCGCTG CATGATGGGA CAGAGCTTAG CGAAACAAGG GATGATAGAG
GAAATAATAC CTCATTACTT TGCGGTAAAA GAAGCAGTTT TCCCCTTTAT CAAATTTTCC
GGCGTTGATC CTATCTTGGG GCCGGAAATG AAATCAACGG GGGAGGTGAT GGGGACTGGA
CGCTGTTTTG GAGAGGCTTT CTACAAGGCG CTTTTAGGGG CCGGAGTGGT ATTGCCCCAA
AAAGGCAAAG TTTTTATTAG TGTTCGAGAT GCCGATAAGC AGCGGATCGT TCCTGTTGCG
CAGGAACTCT CTAGGCTGGG TTTTGAGCTT TTAGCTACAC GGGGGACTAG CACGGTATTA
GATCAGGAGG GGGTTGCTTG TATCCAGATT AATAAGGTGC TCGAAGGTCA GCCCCATATC
GTGGACATGA TAAAAAATGA CGAAATTGCG CTTATCATTA ATACCACGGA AGGCCGTAAG
GCGGTTTCCG ATTCTTATAC CATTCGTCGT TCAGCTTTAC AGCATAAAGT GACCTATACT
ACCACCGTGG CTGGCGCGTG GGCCACGTGT GAAGCCCTGC GTGCAGAAAC CACCGATTCG
GTCTATCGAT TACAAGATTT ACAGCGGGAG GCGCGAAAAT GA
 
Protein sequence
MPKRNDINSI VILGAGPIVI GQACEFDYSG AQACKSLKEE GYRVILVNSN PATIMTDPEL 
ADATYIEPIT WKTVAKIIEK ERPDALLPTM GGQTALNCAL DLAREGILER YDVEMIGANQ
EAINKAEDRD LFRQAMQKIG LDMPRSAIAH SLEEAQQAQR DLGFPIVIRP SFTLGGSGGG
IAYNREEFVE ICERGLDLSP NSELLIDESV LGWKEFEMEV VRDSKDNCII ICTIENLDPM
GVHTGDSITV APAQTLTDKE YQVMRDAALA VLREIGVDTG GSNVQFAVDP ENGRLIIIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELQNEIT GGLTPASFEP SIDYVVTKIP
RFTFEKFPRA DARLTTQMKS VGEVMAIGRN FQESLQKALR SLETGTDGFN EKVDLAAEDV
VETLRYQLRV PSADRIYYLA DAFRAGFSMA EIYDLSHIDP WFLTQIQDLV EVEQGLRGTS
LEQVEKDRLY RLKRQGFSDR RLAVLLGAGE EEVRRRRHTL GIRPVYKRVD TCGAEFATTT
AYLYSTYDEE CEAVPSQRNK IMVLGGGPNR IGQGIEFDYC CVHAALALRE DGYETIMVNC
NPETVSTDYD TSDRLYFEPL TLEDVLEIIS LEQPQGVIVQ YGGQTPLKLA RALEAAGAPI
IGTSPDSIDL AEDRERFQWL IEKLGLKQPP NRTARTQEEA IRLAAEIGYP LVVRPSYVLG
GRAMEIVYNA DELARYMKEA VSVSNDSPVL LDRFLDEATE VDVDAICDGE QVMVGGIMEH
IEQAGVHSGD SACALPPFSL SAAVQNRLRE QVYNMARELK VVGLMNTQFA IQANEIYILE
VNPRASRTVP FVSKTIGIPL AKVAARCMMG QSLAKQGMIE EIIPHYFAVK EAVFPFIKFS
GVDPILGPEM KSTGEVMGTG RCFGEAFYKA LLGAGVVLPQ KGKVFISVRD ADKQRIVPVA
QELSRLGFEL LATRGTSTVL DQEGVACIQI NKVLEGQPHI VDMIKNDEIA LIINTTEGRK
AVSDSYTIRR SALQHKVTYT TTVAGAWATC EALRAETTDS VYRLQDLQRE ARK