Gene Noc_2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2547 
Symbol 
ID3704550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2895433 
End bp2896893 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content57% 
IMG OID637739026 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_344530 
Protein GI77166005 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATTT TTGAATCCTC ACGTCCCGGC CGTCAGGCTC GCGCTCAAGC GCCTAAGCCG 
ACAGCGGCCA CTAACGATTT ACCGGAACGA TTTCTGCGGC AACAGCCGCC GGCCTTGCCC
GAGGTCTCGG AAATGGATGT GGTGCGCCAC TACACGGGCC TGTCCCAGAA AAACTTTTCC
ATTGACACCC ACTTCTATCC CCTTGGCTCC TGCACCATGA AATATAACCC GCGGGCCTGC
CACACCCTGG CCAGCTTGCC GGGTTTTCTG GAGCGCCATC CCGCGACACC AGAAGCCATG
AGTCAAGGCT TTCTGGCTTG CCTTTATGAA CTGCAAGACA TTCTCGCCAA AATGACGGGG
ATGCAGACCA TGAGCCTAGC CCCCATGGCA GGCGCCCAAG GAGAGTTTAC TGGGGTCGCC
ATGATCCGCG CTTACCACGA GGCCAGAGGT GATAAAGAGC GGTGCGAGAT GCTGGTGCCG
GATGCGGCCC ATGGAACCAA TCCCGCCACG GCCACCATGT GCGGTTTCCG AGTCCGGGAA
ATTCCCACCA ACCCGGAAGG GGATGTGGAT CTAGAAGCAC TGCATAAAGC CCTGGGTCCC
CAAACGGCAG GCATTATGCT CACCAACCCC TCCACCCTTG GAATATTTGA TCGCAACATT
CAAGTCATTG CCCAAAGCGT CCATAAGGCG GGCGGGTTAC TTTATTACGA TGGGGCTAAT
TTAAACGCTA TTCTCGGCAA GGTGAAACCT GGCGACATGG GGTTTGATGT CATCCATCTC
AACCTTCATA AAACCTTTTC CACGCCCCAT GGCGGTGGTG GCCCTGGGTC TGGTCCAGTG
GGAGTGGGCG AAAAATTGCT GCCCTTCCTC CCGGTTCCCC GGGTGGCGCG TGCCGAAGGC
GGCAGCTACC GTTGGTTGAC GGCGGAGGAC TGTCCCCAAA CCATCGGCCC CCTCTCAGCC
TGGATGGGGA ACGCCGGGGT GCTGCTCCGG GCTTATATTT ATGTGCGCCT ACTCGGTCTT
GAAGGTATGA AACGGGTAGC GGACTTCTCC GCCCTTAACG CCAATTACTT GGCCCAACGC
ATGGCCGAAG CCGGTTTTGA TTTGGCCTAC CCCATGCGCC GCGCTGGCCA CGAATTCGTG
GTCACCCTGA AACGCCAGGC AAAAGAGCTG GGAGTCACCG CCACGGATTT CGCTAAACGA
CTGCTTGACT TGGGCTTTCA TGCGCCAACG ATTTACTTTC CGCTCCTAGT TCCCGAATGT
TTACTCATCG AACCGGCGGA AACGGAAAGC AAGCAAACCC TGGATGCTTT TGTGGCCGCG
ATGGAACAGA TCGCCAAAGA AGCCCAGGAA AATCCAGAAT TACTCAAACA GGCCCCCCAT
ACCCTACCGG CACGCCGCCT GGATGAGGTA AAGGCGGCCA AAGAACTGGA TCTAGCCTGG
AAACCAACTC CCCATGAGTA G
 
Protein sequence
MLIFESSRPG RQARAQAPKP TAATNDLPER FLRQQPPALP EVSEMDVVRH YTGLSQKNFS 
IDTHFYPLGS CTMKYNPRAC HTLASLPGFL ERHPATPEAM SQGFLACLYE LQDILAKMTG
MQTMSLAPMA GAQGEFTGVA MIRAYHEARG DKERCEMLVP DAAHGTNPAT ATMCGFRVRE
IPTNPEGDVD LEALHKALGP QTAGIMLTNP STLGIFDRNI QVIAQSVHKA GGLLYYDGAN
LNAILGKVKP GDMGFDVIHL NLHKTFSTPH GGGGPGSGPV GVGEKLLPFL PVPRVARAEG
GSYRWLTAED CPQTIGPLSA WMGNAGVLLR AYIYVRLLGL EGMKRVADFS ALNANYLAQR
MAEAGFDLAY PMRRAGHEFV VTLKRQAKEL GVTATDFAKR LLDLGFHAPT IYFPLLVPEC
LLIEPAETES KQTLDAFVAA MEQIAKEAQE NPELLKQAPH TLPARRLDEV KAAKELDLAW
KPTPHE