Gene Noc_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2003 
Symbol 
ID3705193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2310225 
End bp2311742 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content55% 
IMG OID637738480 
ProductMg chelatase-related protein 
Protein accessionYP_343995 
Protein GI77165470 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0606] Predicted ATPase with chaperone activity 
TIGRFAM ID[TIGR00368] Mg chelatase-related protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTAG CGATTGCCTA CAGCCGGGCT CAGGCGGGCC TGGATGCCCC TTTGGTGACT 
GTGGAAGCCC ACCTCTCAAA TGGTCTCCCC GCTTTCTCCA TCGTGGGTCT GCCAGAAACC
GCAGTTAAGG AGAGCAGAGA ACGGGTGCGA GGCGCACTGC TCAACTGCCA TTTTGAGTTT
CCGGCGCGCC GTATTACGGT AAATCTGGCA CCTGCGGATC TGCCCAAGGA AGGGGGGCGC
TTTGACTTGG CGATTGCTTT AGGTATTTTG GCTGCCTCGG GGCAGATTTC CTCACCTGCG
TTAAGGGCTT ATGAATTTGC CAGCGAGCTT GCCTTAAGTG GGAAGGTGCG AAGTATCCGT
GGAGTGTTAC CTGCCGCATT GCAAACGGCA AAAGCGGGCC GTAGTCTAGT GGTCGCTGAA
GAAAATGCTC CCGAAGCGGC CCTCGTGTCC ACGGTTGAAG TATTAGCGGT TTCCCATCTA
TTAGAAATTT GCCAACATCT TCGGGGCGAG TCACGGCTAA CTCCCTTTAC TCCCAACCCT
CTTAAAGTGG TTGCCGATAA AAGAGGGGAT ATTGCAGATA TCCGGGGCCA GTACCATGCC
AAACGAGCGT TGGAAGTGGC AGCCGCAGGG GCTCATAATT TGTTAATGAT CGGACCACCA
GGAACTGGCA AAACCATGCT GGCCAGTCGC CTGCCGGGGA TTTTGCCTGA GATGGCCGAA
GCCGAAGCGT TGGAAAGCGC CACTGTACAG TCAATCAGCA GCCAAGGTTT TAATTCTAGC
CGCTGGCGCC AACGGCCTTT CCGAACTCCC CATCATACGG CTTCTGGAGT AGCCTTGGTA
GGCGGGGGCG GGCAGCCACG GCCAGGGGAG GTATCCTTGG CGCACCATGG AGTGCTCTTT
CTCGATGAGT TGCCAGAATT TGAGCGTCGG GTACTGGAGG TTCTTAGGGA ACCCCTGGAA
TCGGGCCGCA TTGTTATTTC CCGGGCAGCT CAGCAGGCCG AGTTTCCGGC TCGTGTACAG
CTGGTGGCCG CCATGAATCC TTGCCCCTGC GGTTATTTAG GCGATTCCAA AGGCCGTTGC
CGATGCACCA TAGAGCAAGT ACAACGTTAC CGGGCCCGGA TTTCCGGGCC TTTATTAGAT
CGCATCGATA TACAAATCGA GGTGCCACCC GTACCTTTGC ATCAGTTACG AACCGAAAGT
GAGAGCAGGA TGGAAACGAG TTGCCAGGTT CGAACTCGGG TGGAAGCAGC GCGGGAGCGT
CAGTTAGCCC GTTTTGGGCA ACCTAACAGC AGGTTAGGCA ATCGGGAAGT GGAACAGATT
TGCCGCCTTG GAGAGAAGAA TTATCAGTTA CTGGAGCGGG CCTTGGAGCA ATTAGGGCTT
TCGGCGCGCG CCTACCACCG TATATTGAAA GTTGCCCGGA CGATTGCCGA TTTGGAGGGA
AGTGAAACTA TTCGCACGCC CCATCTTTCC GAAGCGATTG GTTACCGGCG ATTAGACCGT
TCCCTTGCCA AATCTTAA
 
Protein sequence
MSLAIAYSRA QAGLDAPLVT VEAHLSNGLP AFSIVGLPET AVKESRERVR GALLNCHFEF 
PARRITVNLA PADLPKEGGR FDLAIALGIL AASGQISSPA LRAYEFASEL ALSGKVRSIR
GVLPAALQTA KAGRSLVVAE ENAPEAALVS TVEVLAVSHL LEICQHLRGE SRLTPFTPNP
LKVVADKRGD IADIRGQYHA KRALEVAAAG AHNLLMIGPP GTGKTMLASR LPGILPEMAE
AEALESATVQ SISSQGFNSS RWRQRPFRTP HHTASGVALV GGGGQPRPGE VSLAHHGVLF
LDELPEFERR VLEVLREPLE SGRIVISRAA QQAEFPARVQ LVAAMNPCPC GYLGDSKGRC
RCTIEQVQRY RARISGPLLD RIDIQIEVPP VPLHQLRTES ESRMETSCQV RTRVEAARER
QLARFGQPNS RLGNREVEQI CRLGEKNYQL LERALEQLGL SARAYHRILK VARTIADLEG
SETIRTPHLS EAIGYRRLDR SLAKS