Gene Noc_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2027 
Symbol 
ID3705178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2338734 
End bp2339987 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content50% 
IMG OID637738503 
Productglycine hydroxymethyltransferase 
Protein accessionYP_344018 
Protein GI77165493 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGCA AAGAGATGCG TATTGCCAGT TATGATGAGG AACTCGAGAC CGCTCTTACC 
AATGAAGCAC GGCGGCAAGA GGAACATATT GAATTAATTG CTTCGGAGAA TTATGTCAGT
CCTCGGGTTT TAGAAGCCCA AGGGTCCGTG CTCACTAACA AGTACGCCGA AGGCTATCCT
GGCAAGCGTT ATTATGGGGG CTGTGAGTAC GTGGATGTGG CGGAGCGGTT AGCTATCGAA
CGGGCTAAAA TATTGTTCGA GGCTGATTAT GCTAATGTCC AACCCCACTC TGGCTCTCAG
GCGAATGCCG CTGCCTGTCT AGCTTTGCTA GCGCCGGGCG ATACCCTCAT GGGGTTGAGT
CTTGCCCATG GCGGGCATCT CACCCATGGC GCCAAGGTCA ATTTTTCAGG TCAAATTTTT
AACGCAGTTC AGTTTGGGGT AAATGCAGAT ACGGGACTTA TTGACTATGA TGAGGTGGAG
CAGCTAGCAA AGGCACATCG CCCCAAACTG ATTATCGCCG GATTTACCGC TTATTCCCGT
ATAGTTGATT GGCAGCGTTT CCGAGCGATC GCGGATGGAG TAGGCGCCTA TTTGCTAGCG
GATATCGCCC ATCTGGCCGG GATGATCGCC GCAGGAATTT ATCCTAATCC AGTGCAAATC
GCCGATGTCA CGACTAGCAC AACCCATAAA ACTTTACGGG GTCCCCGTTC AGGACTGATT
TTGGCTAAAG CCAACCCTGA GATTGAGAAA AAACTCAATT CCAAGGTCTT TCCCGGTATT
CAAGGGGGGC CTTTAATGCA TGTTGTCGCG GCCAAGGCGG TAGCCTTTAA AGAGGCTATG
GAGCCGGCGT TTAAGGATTA TCAACGGCAA GTGATTCGCA ATGCCCAGGC GATGGCAGAG
GCTATTCAGT CTCGAGGCTA TAAAATTGTT TCCGGTGGGA CCGATAGTCA TCTGTTTTTA
GTGGATCTCG TTGCCAAGGG TTTGACCGGC AAGGCTGCAG ATGCCGCGTT GGGTCGAGCA
AATATCACCG TAAATAAAAA TACGGTGCCT AATGATCCTC AATCTCCGTT TGTAACCAGT
GGTATTCGCA TTGGTAGCCC CGCCATGACT ACGCGTGGTT TTAAGGAAGC GGAGATTTGC
GAATTAGCGG GATGGGTTTG TGATGTGCTG GACGATATTG AAAATGAGAC TGTAATTGCG
GACACTAAGG AGAAAGTATT GGCTCTCTGC GCCCGCTTCC CGGTCTATGG TTAG
 
Protein sequence
MYSKEMRIAS YDEELETALT NEARRQEEHI ELIASENYVS PRVLEAQGSV LTNKYAEGYP 
GKRYYGGCEY VDVAERLAIE RAKILFEADY ANVQPHSGSQ ANAAACLALL APGDTLMGLS
LAHGGHLTHG AKVNFSGQIF NAVQFGVNAD TGLIDYDEVE QLAKAHRPKL IIAGFTAYSR
IVDWQRFRAI ADGVGAYLLA DIAHLAGMIA AGIYPNPVQI ADVTTSTTHK TLRGPRSGLI
LAKANPEIEK KLNSKVFPGI QGGPLMHVVA AKAVAFKEAM EPAFKDYQRQ VIRNAQAMAE
AIQSRGYKIV SGGTDSHLFL VDLVAKGLTG KAADAALGRA NITVNKNTVP NDPQSPFVTS
GIRIGSPAMT TRGFKEAEIC ELAGWVCDVL DDIENETVIA DTKEKVLALC ARFPVYG