Gene Noca_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2449 
Symbol 
ID4599790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2611176 
End bp2612456 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content71% 
IMG OID639777051 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase / GTP cyclohydrolase II 
Protein accessionYP_923640 
Protein GI119716675 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG AGATCAGGCT CGACACCGTC GAGCGGGCGA TCGCCGACAT CGCCGCCGGC 
AAGGCCGTGG TCGTCGTCGA CGACGAGGAC CGCGAGAACG AGGGCGACAT CATCTTCGCC
GCCAGCAAGG CGACCCCCGA CCTGATGGCC TTCACGATCC GCTACAGCAG CGGCGTGATC
TGCGTGCCGA TGCCGGCCCG GATGCTCGAC CGGCTCGAGA TCCCGCTGAT GACGCCGCAC
AACAAGGACC GGCTGCGTAC GGCGTACACG ATCTCGGTCG ATGCCCGCGA CGGGGTGACC
ACGGGCATCT CCGCCGCCGA CCGGGCGCAC ACCGTCCGGG TGCTCGCCGA CTCGGCGACC
GAGCCGTGGG AGATCACCCG CCCCGGTCAC GTCTTCCCAC TGCGCTACCG CGAGGGCGGC
GTGCTGGTGC GCCGCGGACA CACCGAGGCC GCGGTCGACC TCGCGAAGCT GGCCGGGTTG
ACCCCCGCGG GCGTGCTGGT CGAGGTCGTC AACGACGACG GGACCATGAA GCGCGGGCCC
GAGCTGCGCG CCTTCGCCGA CGAGCACGGC CTGGCGATGA TCTCGATCGA CGACCTGGTG
CGCTACCGGC GGCGCCACGA GACCCTCGTC GAGCGGGTCG CCGAGACCCA GCTGCCGACC
CGGCACGGTG ACTTCACGGC GTACGGCTAC CGGATCACCG TCGACGGCTC CGAGCACATC
GCGCTCGTCC ACGGCGACAT CAGCGGACCG GAGCCCGTGC TCACCCGGGT GCACTCGGAG
TGCCTGACCG GCGACGTGTT CGGCAGCCAC CGCTGCGACT GCGGGCCACA ACTGGAGGAG
GCCCTCGAGC GGATCGTGGC CGAGGGGCGC GGCGTGGTCG TCTACCTGCG CGGCCACGAG
GGCCGCGGGA TCGGGCTGGT CGCGAAGCTG CAGGCCTACC AGCTCCAGGA CGGCGGCCGG
GACACCGTCG ACGCGAACCT CGACCTCGGC CTGCCGGCCG ACGCCCGCCA CTACGGCACG
GCCACCCAGG TGCTGCGCGA CCTCGGCGTC GGCAGCGTCC GGCTGATGAC CAACAACCCG
GACAAGGTGC GCAACCTCGA GGACTACGGT GTGTCGGTCG CCGCCCGGGT GCCGCTGACG
CCGCACCCCA ACGACCACAA CATCGCCTAC CTGCTCACCA AGCGCGACCG AATGGGTCAC
GATCTGCCCA ACCTTGCCGA TGGGGTGCCC GACACCCGTG CCGACGGGGT GCCCGACACC
CTTGCCCAGA ACGGAGCCTG A
 
Protein sequence
MSTEIRLDTV ERAIADIAAG KAVVVVDDED RENEGDIIFA ASKATPDLMA FTIRYSSGVI 
CVPMPARMLD RLEIPLMTPH NKDRLRTAYT ISVDARDGVT TGISAADRAH TVRVLADSAT
EPWEITRPGH VFPLRYREGG VLVRRGHTEA AVDLAKLAGL TPAGVLVEVV NDDGTMKRGP
ELRAFADEHG LAMISIDDLV RYRRRHETLV ERVAETQLPT RHGDFTAYGY RITVDGSEHI
ALVHGDISGP EPVLTRVHSE CLTGDVFGSH RCDCGPQLEE ALERIVAEGR GVVVYLRGHE
GRGIGLVAKL QAYQLQDGGR DTVDANLDLG LPADARHYGT ATQVLRDLGV GSVRLMTNNP
DKVRNLEDYG VSVAARVPLT PHPNDHNIAY LLTKRDRMGH DLPNLADGVP DTRADGVPDT
LAQNGA