Gene Noc_2606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2606 
Symbol 
ID3704361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2960141 
End bp2961181 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content52% 
IMG OID637739087 
Productthreonine aldolase 
Protein accessionYP_344589 
Protein GI77166064 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000143153 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGAAC GGCATTTTAT CAGCGACAAC GCAGCCGGTA TACACCCAGA AGTCATCGCC 
ATGCTGGAAA GGGCTAGCCG CGGCCACGCC ATTGCTTATG GCAACGATTC CTTAACGCAG
CAGGCTCTCC AGCTATTCAA ACAGCACTTC GGGGCACAGA CTGAGACATT TTTTGTACTG
ACTGGCACCG CCGCCAACGT CATTGCCCTG CAAAGCGTCC TATCTTCCTT CGAGGCTGTT
ATTTGTGCCG ACTGTGCCCA CCTCCACCGG GACGAATGCG GAGCGCCGGA AAAATTCCTG
GGATCAAAGC TGCTAATCGC CCAAACTCAG CAAGGCAAGC TAAGCGTAGC AACTGTAGCG
CCACTATTGC GCGACACGGC TATGGTCCAT CGCGTCCAGC CGAAAGTCCT TTCCATTACC
CAATGCACGG AATGGGGGAC TATCTATACT CCTGCAGAGA TCAAAACCCT GGCGGATTTC
TGCCATGAGC AAGGGTTGCT GCTACACATG GATGGGGCTC GGTTAAGTAA CGCCGCTGCC
CGACTCAATT TAAGCCTAAA AGAGATGACC GCAGATGTGG GCGTGGATGT ACTTTCCTTT
GGTGGCACCA AAAATGGGCT GCTAGCAGCT GAAGCGATTG TTTTCTTCGA TCCCCAGTTG
GCGAAAAAAA CCGGCTTCTA CCGTAAACAA AGCATGCAAC TAGCCTCCAA AATGCGTTTT
ATCTCAGCTC AATTTTTAGC TCTATTAATC AACGATCTCT GGTGGAAAAA TGCGCAGCAC
GCCAATGAAA TGGCGGCTTT ACTGGAACGA GAACTCAAGA ATATCCCGCA GGTGGAACTT
GTCGTCCCCG TAGAAACCAA CGGGATATTT GCCCGAATAC CTCCCTCTTG GGTACCCTGT
TTACAACAAC ATTATGCCTT TGCGGTCTGG GACTCGGCTA GCACGGTAGT GCGCTGGATG
ACCTCATTTG ATACCACGGC GGAGGAAGTG CAAGATTTTG CGCAAAAGAT CCGAAACATG
AACGAGGATA ACGCCCCCTA A
 
Protein sequence
MGERHFISDN AAGIHPEVIA MLERASRGHA IAYGNDSLTQ QALQLFKQHF GAQTETFFVL 
TGTAANVIAL QSVLSSFEAV ICADCAHLHR DECGAPEKFL GSKLLIAQTQ QGKLSVATVA
PLLRDTAMVH RVQPKVLSIT QCTEWGTIYT PAEIKTLADF CHEQGLLLHM DGARLSNAAA
RLNLSLKEMT ADVGVDVLSF GGTKNGLLAA EAIVFFDPQL AKKTGFYRKQ SMQLASKMRF
ISAQFLALLI NDLWWKNAQH ANEMAALLER ELKNIPQVEL VVPVETNGIF ARIPPSWVPC
LQQHYAFAVW DSASTVVRWM TSFDTTAEEV QDFAQKIRNM NEDNAP