Gene Noc_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2040 
Symbol 
ID3705191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2350337 
End bp2351512 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content51% 
IMG OID637738515 
ProducttRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase 
Protein accessionYP_344030 
Protein GI77165505 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0482] Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 
TIGRFAM ID[TIGR00420] tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.527359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGCA CTAACGAACA CGGAGAAGCG AGAACGGAGT GCACCATCTC CCGCCGTTTT 
CCGCGCAATC AAATGAATAA GCCTAAGGTG GTTGTGGGTC TATCCGGAGG GGTTGACTCC
TCGGTTGCTG CCCTCCGGCT TCAGCAGCAA GGTTATCCCA CAAGTGGCCT TTTCATGAAG
AATTGGGTGG ATTTTGCCGA CGAATCCGAA TGCACTGTTG CGCAAGATAG GGAAGACGCC
CAAGCGGTGA CCTCCCTGCT AGGGATCTCC TTTCATGAAG CCAATTTCGC AATGGAATAC
TGGGATCGCG TATTCCACTA CTTTCTGGAA GAATACCGCC TGGGCCGTAC TCCCAATCCG
GATATTCTGT GTAACCGGGA AATTAAGTTC AAAACCTTCC TTGACCACGC TCTAGCGCTC
GGCAATACTC TTATCGCTAC GGGTCATTAT GCCCGCATTG ACCAGAAAGA AGGACGTTAT
CGTCTCCTTA AAGGACGGGA TAAAAATAAG GATCAAAGTT ATTTTTTGTA TACTTTAGGG
CAAAAACAAC TTTCTCATAC GCTGTTTCCG TTGGGAGAGT TGGAAAAGCC ACAAGTCCGC
CGAATTGCTA TACAAGCCCG GCTTCCTAAC CACAGCAAAA AAGATAGCAC CGGAATCTGT
TTTATCGGAG AACGCCGTTT CAAGGAATTT CTCAGTCGCT ATCTCCCCGC TCAGCCAGGT
GAAATGCGCA CCCCCGATAA GGAATGGATC GGCCAGCACG ACGGGCTTAT GTACTACACT
TTAGGACAAC GGCGGGGACT AGGCATTGGC GGCCGTAAAA ATAGCAACGG CGCGCCCTGG
TTCGTGGTGG GTAAAAACTT GCCACAGAAC ACCCTCTATG TGGCCCAGGG CAGGCATCAT
CCTTGGCTGA AAAGCTTAAT TCTGGAGGCT AATCAGCTCC ATTGGGTAGC TGGCCACCCC
CCACCCCTGC CCTATTCCTG TACCGCCAAA ATCCGCTACC GCCAGACCGA TCAGCACTGC
ACCCTTACAG CAATAGCCAA TGGCTATATC AAAGTCATTT TTAATGCACC ACAATATGCC
GCCACTCCAG GACAAGCCAT CGTCTTCTAT CAGGACGATG AGTGCCTTGG TGGAGGCACC
ATTACTACCA CGGACGCACT GGGGATACAA TCATGA
 
Protein sequence
MHSTNEHGEA RTECTISRRF PRNQMNKPKV VVGLSGGVDS SVAALRLQQQ GYPTSGLFMK 
NWVDFADESE CTVAQDREDA QAVTSLLGIS FHEANFAMEY WDRVFHYFLE EYRLGRTPNP
DILCNREIKF KTFLDHALAL GNTLIATGHY ARIDQKEGRY RLLKGRDKNK DQSYFLYTLG
QKQLSHTLFP LGELEKPQVR RIAIQARLPN HSKKDSTGIC FIGERRFKEF LSRYLPAQPG
EMRTPDKEWI GQHDGLMYYT LGQRRGLGIG GRKNSNGAPW FVVGKNLPQN TLYVAQGRHH
PWLKSLILEA NQLHWVAGHP PPLPYSCTAK IRYRQTDQHC TLTAIANGYI KVIFNAPQYA
ATPGQAIVFY QDDECLGGGT ITTTDALGIQ S