Gene Noc_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0471 
Symbol 
ID3706642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp505981 
End bp507786 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content56% 
IMG OID637736980 
Productthiamine pyrophosphate protein 
Protein accessionYP_342524 
Protein GI77163999 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA TAGTCAGCGA TTTTCTCTTA CACCGATTGA ACGAATGGGG CATCAACCGG 
ATTTACGGCT ATCCCGGGGA TGGCATCAAT GGAATCGTCG GCGCCCTGGA CCGGCTTCAA
GACCGGATAG AGTTTATTCA AACCCGGCAT GAGGAAATGG CGGCTTTTAT GGCCTGCGCC
CATGCTAAAT TTACCGGCGA AGTAGGCGTC TGCCTGGCCA CTTCAGGACC CGGGGCCATC
CACCTGCTGA ATGGCCTTTA TGATGCCAAA CTGGACCATC AGCCGGTGGT GGCCATTGTG
GGCCAACAAT CCCGCGCCGC CCTCGGGGGA GATTATCAAC AAGAAGTGGA TCTCATTTCC
TTGTTCAAGG ATGTCGCCCA TGAATACGTA CATATGTGCG CTACTCCCGC CCAGGTGCGC
CATTTAATTG ATCGCGCGGT CCGCATTGCC AAAACAGAGC GCACCGTGAC CTGCCTTATC
TTTCCCAATG ACGTGCAGGA ATTGGAAGCC GTTGAGAAAC CGCCACGGGC TCACGGCACC
ATCCATTCCA GCACCGGGTA TACGATCCCC CGGGTGATTC CTCATCAGCA AGATCTCCAA
CAAGCTGCCG AGGTGCTCAA TAGAGGCAAA AAGGTCGCTA TCCTGGTGGG AGCTGGCGCT
TTGGGGGCCA CGGATGAAGT TATTCAGGTC GCTGAACTGC TCGGCGCAGG GGTAGCGAAA
GCCTTGCTGG GCAAGGGCGC TCTGCCTGAT GAACTTCCCT TCGTGACCGG CGCTATCGGC
CTGCTGGGGA CTAAACCGAG CTGGGAATTA ATGGACGGCT GCGATACGCT GTTGATGATT
GGTTCAAGTT TCCCCTATTC CGAATTCCTG CCGGAGGAAG GCCAAGCCCG GGGCGTGCAG
ATTGACCTGG ACGGGCGCAT GCTGGGAATC CGCTATCCCA TGGAAGTGAA TCTGGTGGGA
GACAGTGCGG AAACCCTGCG GGCCTTAATC CCTCTTCTCA CACGAAAAAC GAACCGGGCC
TGGCGAGAGA AGATCGAAAA AGACGTGGCC CAATGGTGGC AGGTACTCGA AAGCCGCGCC
ATGCACGATG CGGACCCTAT TAACCCCCAG CGGGTTTTCT GGGAGCTTTC TTCCCGACTG
CCGGATAACT GCATCATCAG CAGCGACTCC GGTTCCGCCG CCAACTGGTA TGCCCGGGAT
CTTAAAATCC GCCGAGGTAT GATGTGCTCT CTCTCGGGGG GCTTGGCGAC CATGGGCCCC
GGCGTTCCCT ATGCCATTGC GGCCAAATTC GCCTTTCCGG ATCGGGTGGC TATTGCCCTT
GTAGGGGATG GAGCCATGCA GATGAACGGC AACAGCGAAC TGGTCACCGC AGCTAAATAT
TGGCAACAAT GGCAAGATCC CCGGCTGATT GTCTTGGTAC TCAATAATCG GGATCTCAAT
CAAGTCACCT GGGAGCAGCG GGTGATGTCG GGCGATCCCA AGTTCGAAGG CTCCCAAAGC
TTGCCCGACT TTCCCTATGC CCGTTATGCC GAACTACTTG GCTTTAAAGG CATCCGCGTT
GATCGGCCGG AAAGTATCGG CCCCGCTTGG GAGGAAGCCC TAGCCGCTGA CCGACCCGTA
ATACTAGAAG CGTATACCGA TGGGAACGTG CCGCCCTTGC CTCCCCATAT CAAGCTGGAA
CAGGCCAAAG CCTATGTCTC CGCCTTGCTG CACCGAGATC CGGAAGCCAT CAACATTATT
AAGCAGTCCA TCAAGGAAAT TAAAGAAAGC TGGTTTTCCA GTGGTCAAGA AGAAAAGGGC
AATTAG
 
Protein sequence
MSQIVSDFLL HRLNEWGINR IYGYPGDGIN GIVGALDRLQ DRIEFIQTRH EEMAAFMACA 
HAKFTGEVGV CLATSGPGAI HLLNGLYDAK LDHQPVVAIV GQQSRAALGG DYQQEVDLIS
LFKDVAHEYV HMCATPAQVR HLIDRAVRIA KTERTVTCLI FPNDVQELEA VEKPPRAHGT
IHSSTGYTIP RVIPHQQDLQ QAAEVLNRGK KVAILVGAGA LGATDEVIQV AELLGAGVAK
ALLGKGALPD ELPFVTGAIG LLGTKPSWEL MDGCDTLLMI GSSFPYSEFL PEEGQARGVQ
IDLDGRMLGI RYPMEVNLVG DSAETLRALI PLLTRKTNRA WREKIEKDVA QWWQVLESRA
MHDADPINPQ RVFWELSSRL PDNCIISSDS GSAANWYARD LKIRRGMMCS LSGGLATMGP
GVPYAIAAKF AFPDRVAIAL VGDGAMQMNG NSELVTAAKY WQQWQDPRLI VLVLNNRDLN
QVTWEQRVMS GDPKFEGSQS LPDFPYARYA ELLGFKGIRV DRPESIGPAW EEALAADRPV
ILEAYTDGNV PPLPPHIKLE QAKAYVSALL HRDPEAINII KQSIKEIKES WFSSGQEEKG
N