Gene Noc_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1020 
Symbol 
ID3707281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1130615 
End bp1131799 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content56% 
IMG OID637737525 
Producttryptophan synthase subunit beta 
Protein accessionYP_343058 
Protein GI77164533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATG AAAAGGGTCA TTTTGGTCCT TATGGTGGCC GCTTTGTAGC GGAAACCCTG 
ATGGAGCCTA TTGAGGAACT CCGCCAGGCC TATGCACGCT ACCGTGATGA TCCAGATTTC
CAGGCCGAGC TAGAAGCCGA CCTTGCCCAT TATGTGGGGC GGCCGACGCC CTTGTATTTT
GCCAAACGCT GGAGCGAGCA GTTAGGGGGT GCCCGGATCT TTTTGAAGCG GGAAGATCTA
AGTCATACCG GCGCCCATAA AATCAACAAT ACCGTGGGGC AAGCCCTGCT GGCAAAGCGG
ATGGGGAAGA CTCGTTTGAT CGCGGAAACG GGAGCGGGTC AGCATGGCGT AGCTACCGCC
ACGGTAGCCG CCCGTTTAGG GTTGGAATGC GTGGTCTACA TGGGGGCTGA GGATATAGAA
CGACAAGCCC CTAATGTTTA CCGGATGCGC CTGTTAGGGG CAGAAGTAGT GCCGGTGACT
TCCGGGTCTA GAACCTTGAA GGATGCCTTG AATGAAGCCA TGCGGGATTG GGTGACTCAT
GTGGATAACA CCTTTTATGT GATTGGCACC GTGGCGGGGC CTCATCCCTA TCCCGTGATG
GTGCGGGATT TCCAGGCGGT GATTGGGCGG GAGGCCCGGA CCCAGATTCT AGAGCAGCTA
GGCGCGCTTC CCCAGGCTTT GGTGGCTTGC GTGGGGGGAG GGTCCAACGC CATCGGCTTA
TTTCATTCTT TTTGCAAGGA TGAGCAGGTC GCACTCTATG GTGTGGAAGC GGCGGGGCTA
GGATTGGAGA GCGGACAACA TGCCGCTTCC CTCTGTGCGG GCAAACCGGG TGTTCTTCAT
GGTAACCGGA CCTATTTGGT GGAAAATAGC CATGGTCAAA TTATAGACAC CCATTCCATT
TCAGCGGGTC TTGATTATCC TGGGGTAGGC CCCGAGCATG CTTGGCTGAA GGATACTGGG
CGGGCGACCT ATGTGGCGAT TAGCGATGAG GAGGCCCTGG GGGCTTTTCA TGCCTTGACC
CGTATTGAAG GCATTATTCC CGCTTTGGAA AGCAGTCATG CCTTGGCCTA CGCTACGCAA
TTGGCCCCCA CTTTAGCTCC AGAGCAGGGG ATTATCGTGA ATCTTTCAGG GCGCGGGGAT
AAAGATATCA ATACGGTAGC CAAAATTGAG GGGATTTCCC TGTGA
 
Protein sequence
MPDEKGHFGP YGGRFVAETL MEPIEELRQA YARYRDDPDF QAELEADLAH YVGRPTPLYF 
AKRWSEQLGG ARIFLKREDL SHTGAHKINN TVGQALLAKR MGKTRLIAET GAGQHGVATA
TVAARLGLEC VVYMGAEDIE RQAPNVYRMR LLGAEVVPVT SGSRTLKDAL NEAMRDWVTH
VDNTFYVIGT VAGPHPYPVM VRDFQAVIGR EARTQILEQL GALPQALVAC VGGGSNAIGL
FHSFCKDEQV ALYGVEAAGL GLESGQHAAS LCAGKPGVLH GNRTYLVENS HGQIIDTHSI
SAGLDYPGVG PEHAWLKDTG RATYVAISDE EALGAFHALT RIEGIIPALE SSHALAYATQ
LAPTLAPEQG IIVNLSGRGD KDINTVAKIE GISL