Gene Noc_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0185 
Symbol 
ID3706218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp204089 
End bp205252 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content48% 
IMG OID637736702 
Producttetratricopeptide repeat protein 
Protein accessionYP_342248 
Protein GI77163723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.340896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGAAT GGTTATTGTT GCTGTTACCT GTGGCAGCGG CTTCAGGCTG GCTAGCAGGT 
AAGCGCAGTG CAGAAACCGT AAATGCGGAT AGTCACTCCC AACTAAATTC CGCCTACTTT
GCCGGCCTAA ATCATTTGTT AAATGAGCAG CCAGATAAGG CTATTGATAC CCTGCTTAAT
GCTTTGAAAG TGGATAGCGA CACGGTAGAA CCCTATCTAG CATTAGGTAA TCTATTTCGC
CGGCGCGGGG AGGTAGACCG GGCGATTCGG GTTCATCAAA ATCTTATTGA GAGGCCTTAT
TTAAGCAGCT CGCAAAGAGG ACAAGCCCTT TTGGAATTGG GTTTGGATTA TATGCGCGCG
GGAATGTTGG ATCGGGCTGA AAGCTCTTTT CTTGAGGTCC TTAAGCGAAG AAGCCACATA
GGTATTACCC TGCGCCAGTT ACTTGATCTC TATCAGCAGG AAAAGAATTG GCATCAAGCT
ATTGCTATGG CTCAAAAGCT GCACGAGGAG AGTGGCGAAG CGACGGAATC CATGATCGCT
CATTTTTATT GTGAACTTGC GGAACAGCAT TGGGCCCAGA AAAAAGCTGT GGAAACGACC
CGGTTTATCA AGCAGGCGCT GGCCTCGGAT TGGCGCTGTG TTCGAGCAAC TCTGCTCCAG
TCCAGTTTAG CAATGGAGAA AGGGGATTAT AAGAGGGCTA TTCGCTGTTT GCGGCAGGTT
GAGAGGCAAG ATCCAGACTA TTTGCCGGAG ATATTAAAGC CGCTCTCGGA ATGCTACCAG
TACCTGGAGG GCCAAGATAA ATTCTTTTTC TGGCTAACTG AAGCATCAAA GCGCCATCCA
GGATGTACTT CATTAGTTTT AGCCAGAGCG GCATACTTAC AGCAGCGGGG AGAACAGAAA
GAGGCTCGCT ATTTCCTAAT CGAGCAACTT AGAGTGTATC CTTCCGTTGA GGCACTTCAG
CAGTTGCTTG CTTTGGGAGT GCCAGAGGAT ATTGAGGCTG CTTCAGAGCC TTGGTCTTTA
ATAGAAGAAG TGGCTAGCCG CCTGTTAAAA GCTAAATTAA ATTACGTTTG CGGTTTTTGC
GGATTTGGCG GCAAGTATTG CTATTGGCAA TGCCCAGGTT GTAAACGCTG GGGGACCGTT
AAGCCCTTGG CTGTAGGTAC TTAA
 
Protein sequence
MIEWLLLLLP VAAASGWLAG KRSAETVNAD SHSQLNSAYF AGLNHLLNEQ PDKAIDTLLN 
ALKVDSDTVE PYLALGNLFR RRGEVDRAIR VHQNLIERPY LSSSQRGQAL LELGLDYMRA
GMLDRAESSF LEVLKRRSHI GITLRQLLDL YQQEKNWHQA IAMAQKLHEE SGEATESMIA
HFYCELAEQH WAQKKAVETT RFIKQALASD WRCVRATLLQ SSLAMEKGDY KRAIRCLRQV
ERQDPDYLPE ILKPLSECYQ YLEGQDKFFF WLTEASKRHP GCTSLVLARA AYLQQRGEQK
EARYFLIEQL RVYPSVEALQ QLLALGVPED IEAASEPWSL IEEVASRLLK AKLNYVCGFC
GFGGKYCYWQ CPGCKRWGTV KPLAVGT