Gene Noca_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2041 
Symbol 
ID4595795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2184532 
End bp2185695 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content72% 
IMG OID639776644 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_923237 
Protein GI119716272 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATGC TCGTTCTCGG ACTGACCGCG ATGCTCGGCC TGACGCTCAC GACCGCCACG 
CCGGCCGACG CCCATGCCCC CGCCACCGAC ACCGCCGCCG CGACGACGCA GGTTCCGGCA
CTGAGGGTCA CCCGGCTGGT CACCGGCCTC GACCACCCCT GGGACGTCCG GCCGATCGGC
GACGGCCGGC TGATCTTCAC CCAGCGCGAC CGGGCCACCG TCTCGATCTG GGACGGCAGC
CGGACCCGGC TGGTGCAGGG CTTCCCGAGC GACTCGGTGT GGGTCTCCGG CGAGACCGGG
CTGATGGGGC TCGAGGTCGA CCCGTCGTTC GCGAGCAACC GCACGTTCTA CACCTGCCAG
GGCGGCTTCA CCGCCGGTGG CGGGCACGAC GTGCGCATCA TCCGCTGGAC GCTGCGCGAC
GACCTGGTCT CGATCTCGGG GAGCAAGCGG CTGCTCGGCG GACTGCCCGC CACCAGCGGA
CGGCACGGGG GTTGCCGGCT GCTGGCGGTC GGCAAGCGGC TGTACGCCGG CACCGGGGAC
GCCGCGACCG GCAGCACGCC GGAGAACAAG AAGTCGCTGG GCGGCAAGAC GCTGTGCCTG
CTCGCCGCGA CCGGCAAGCC CTGCGGGAGC AACCCGTTCG CCGGGTCGAA GAACCACAAC
AAGCGCTACG TGCACACCTA CGGCCACCGC AACGTCCAGG GCCTCGACCG GCGCCGCGAC
GGCACCCTGT GGTCGGTCGA GCAGGGCAGC TACCGCGACG ACGAGGTCAA CCGGCTGCGC
AAGGGCGGCG ACTACGGCTG GAACCCGGTC CCGGGCTACG ACGAGTCGGT GCCGATGACC
GACCAGTCGC TGCCGGGCCG CCAGCGCGCC GCCGTATGGC GCTCGGGCGA CCCGACGCTG
GCCACCTCCG GCGGCGGCTT CGTCTACGGC AAGCGATGGG GCGCCCTGGA CGGCAGCTTC
GCGGTCGCCG CGCTCAAGGC GGAGAGGGTG CTGTTCCTCC AGCTCTCCGC ATCCGGCAGG
CTGCAGAGGG TGCGGGTGCC GGCGGCGCTG CGTCAGCACG GCCGGATCCG CACGGTGGTC
GACGGCCCCG GCTCGGTCGC CTACGTCACC ACCGACAACG GGAACGGGAA CGACGCGATC
CTCGTTGTCA GACCCACACG ATGA
 
Protein sequence
MRMLVLGLTA MLGLTLTTAT PADAHAPATD TAAATTQVPA LRVTRLVTGL DHPWDVRPIG 
DGRLIFTQRD RATVSIWDGS RTRLVQGFPS DSVWVSGETG LMGLEVDPSF ASNRTFYTCQ
GGFTAGGGHD VRIIRWTLRD DLVSISGSKR LLGGLPATSG RHGGCRLLAV GKRLYAGTGD
AATGSTPENK KSLGGKTLCL LAATGKPCGS NPFAGSKNHN KRYVHTYGHR NVQGLDRRRD
GTLWSVEQGS YRDDEVNRLR KGGDYGWNPV PGYDESVPMT DQSLPGRQRA AVWRSGDPTL
ATSGGGFVYG KRWGALDGSF AVAALKAERV LFLQLSASGR LQRVRVPAAL RQHGRIRTVV
DGPGSVAYVT TDNGNGNDAI LVVRPTR