Gene Noc_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2666 
Symbol 
ID3705163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3021589 
End bp3023118 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content52% 
IMG OID637739147 
Productthreonine dehydratase 
Protein accessionYP_344649 
Protein GI77166124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGTACA GCTATCTAGA ACGCATTCTC AAGGCCCGTG TTTATGAAAT TGCCAAGGAA 
ACTCCGCTGG AATCCATGGG GCGTCTTTCT GGGCGATTGC AAAACGCCGT ATTGCTAAAG
CGGGAAGATC TGCAACCCGT TTTTTCCTTT AAGTTGCGGG GAGCTCATAA CAAACTCCTC
CAGTTATCGG AGGAGGCACG TCAGCGAGGG GTGATTGCCG CTTCGGCGGG CAACCATGCT
CAAGGCGTTG CGCTTTCCGC CCGCAAGCTG GGGATAAGCG CCCGGATTGT GATGCCTCGG
ACAACACCGC CAATCAAAAT TGAAGCAGTG CGTGATCTGG GTGCCGAGAT CGATTTGGTG
GGCAATACTT ACGATGAAGC CTACCAATAT GCGCTGGCTT TGGCTGAAAA GCAGGTGTGT
ACTTTTATCC ATCCCTATGA CGATCCTGAA GTTATTGCTG GCCAAGGAAC CGTAGCGATG
GAGATTCTGC GCCAATACCC AGAGCCGTTG CATGCCATAT TTGTACCTGT AGGTGGCGGT
GGGCTTATTG CCGGGGTGGC TGCCTATGTT AAAGCTCTTT CGCCGGAAAT CCGTGTTATT
GGGGTGGAGC CCGACGATGC ATCTAGTCTT TATCAGGCTC TGCAAGTAGG AGAGCGGGTA
GTACTTGATC AGGTGGGTAT TTTTGCGGAT GGCGCCGCAG TTCGCCAGGT AGGCAAAGAA
CCCTTCCGCA TTGCGCGGGA AGCCGTGGAT GAAGTGCTGT TGGTGGATAG TGATGCCATT
TGCGCCGCCA TCATGGATAT TTTCGAGGAT ACCCGTTCTA TCGCTGAGCC GGCAGGGGCT
TTGGCTGTTG CCGGGTTAAA GCAGTATGTG GAGCGGGAAG GACTCCGGGG ACAGAGTTTG
GTAGCCATTG ACAGTGGGGC CAATATCAAT TTTGACCGGC TGCGCCATGT GGCCGAGCGG
GCTGAATTAG GTGAGCGGCG GGAGGCTTTG TTTTGTGTCA CAATCCCCGA GCAGCGGGGC
AGTTTTCTTG CTTTTTGCGA GGCCATTGGT AAACGGGGCA TTACCGAGTT TAATTACCGC
TATGGCGATT CCGGTGAGGC CCATGTGTTT GTGGGAATTC AGACGCGCAA CGGCAGTCAT
GTCAAGGATC AGTTACTCCA TGATCTTCAC CAGAAGGGCT ATTCCGTGGT GGATATGAGC
GATAACGAAA TGGCCAAACT CCACGTGCGT TATATGGTAG GCGGACAAGC ATCAAAGTTA
AAGGATGAGG TCCTTTACCG GTTTGAATTT CCCGAGCGTC CAGGTGCTCT ACTGCGTTTT
TTAACTCATA TGGGTAGCCG TTGGAATATT AGTCTGTTCC ACTACCGTAA CCATGGGGCA
GCTTATGGCC GGGTATTGGC AGGCATTCAA GTGCCTCCCG CCGAAAAAAT AGAATTTCAG
ATTTTTTTGG ATGAACTTGC CTATACCTGC CAGGAGGAGA CCGATAATCC CGCTTACCGG
TTATTTCTTG GTCCGCCATC CGCCTCGTGA
 
Protein sequence
MLYSYLERIL KARVYEIAKE TPLESMGRLS GRLQNAVLLK REDLQPVFSF KLRGAHNKLL 
QLSEEARQRG VIAASAGNHA QGVALSARKL GISARIVMPR TTPPIKIEAV RDLGAEIDLV
GNTYDEAYQY ALALAEKQVC TFIHPYDDPE VIAGQGTVAM EILRQYPEPL HAIFVPVGGG
GLIAGVAAYV KALSPEIRVI GVEPDDASSL YQALQVGERV VLDQVGIFAD GAAVRQVGKE
PFRIAREAVD EVLLVDSDAI CAAIMDIFED TRSIAEPAGA LAVAGLKQYV EREGLRGQSL
VAIDSGANIN FDRLRHVAER AELGERREAL FCVTIPEQRG SFLAFCEAIG KRGITEFNYR
YGDSGEAHVF VGIQTRNGSH VKDQLLHDLH QKGYSVVDMS DNEMAKLHVR YMVGGQASKL
KDEVLYRFEF PERPGALLRF LTHMGSRWNI SLFHYRNHGA AYGRVLAGIQ VPPAEKIEFQ
IFLDELAYTC QEETDNPAYR LFLGPPSAS