Gene Noca_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2120 
Symbol 
ID4599964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2267364 
End bp2268464 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID639776723 
Productalcohol dehydrogenase 
Protein accessionYP_923316 
Protein GI119716351 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCA CCGCTGCCGT TTCCCGTGAG AAGGGCGCGC CGCTCGTCGT CGAGGAGCTG 
GAGCTGGATG CGCCACGCTC CACCGAGGTG CGCGTGCGAA TGGTGGGTTC CGGGATCTGC
CACACCGACG CTGTTGCCCG GGACCGCATC TACCCGGTGC CCGAGCCGTC GGTCTTCGGA
CACGAGGGGT CTGGCGTCGT CGAGGAGGTC GGCTCCGATG TGCGTGGCGT GCAGGTGGGG
GACCACGTCG TGCTGGGCCC GTCGTACTGC GGCAAGTGCA CCTTCTGCCG AAGCGGTGAG
CCGATGTACT GCGAGAACGG CTTCCCCGAG CTGTTCGGTT GTCGTCGGCA CGATGGGACC
ACGGCCTTCA GCAAGGATGG CGAGATGGTC GGCTCCCACT TTTTCGGGCA GTCGTCGTTC
GCGACCCACG CCAACGTCAC CGAGAACAGC GTCATCGTCG TCGACAAGGA CGCCCCGCTG
GAGCTGCTCG GCCCACTGGG ATGCGGGCTC AACACCGGTG CGGGGGCCGT GCTCAACGAG
ATGCGGCCGG CGGCCGGGTC CTCGATTGTC GTCTTCGGTA CCGGAGCGGT CGGCTTCGCC
GCGCTCATGG CGGCAGCCGC GGTGTCGTGC TCCACGATCA TCGGCGTCGA CATCCACGAC
TCCCGTCTGG AGCTGGCCCG GGAGCTGGGC GCGACGCACA CCATCAACTC CTCGTCCCAG
GACCTGCATG CCGAGCTGGA GAAGATCACC GGCGGGCGGG GCGTGAACTA CGCACTGGAC
ACCACCGCGA GGTCAAGCGT GGTTCGGGAC GCTGCCGATG CGCTCGGCAA GCGGGGTGTG
CTCATCGCGG TCGGCGCGGC CGCGCCCGGC GATGAGGTCA GCTTCGAGGT CGGCAACTCT
CTGGTCAAGG GCTGGACCTT CAAGACCGTG ATCGAGGGGT CGGCAGTGCC GCAGGTGTTC
ATCCCGCGCC TGGTCGACCT GTGGAAGCAG GGCAAGTTCC CCTTCGACAA GCTGGTGAAG
ACCTACTCCC TGCATGACAT CAACACCGGC TTCGAGGACT CCGCCTCCGG GGCCGTCATC
AAGCCCGTGG TTGCCTACTG A
 
Protein sequence
MTVTAAVSRE KGAPLVVEEL ELDAPRSTEV RVRMVGSGIC HTDAVARDRI YPVPEPSVFG 
HEGSGVVEEV GSDVRGVQVG DHVVLGPSYC GKCTFCRSGE PMYCENGFPE LFGCRRHDGT
TAFSKDGEMV GSHFFGQSSF ATHANVTENS VIVVDKDAPL ELLGPLGCGL NTGAGAVLNE
MRPAAGSSIV VFGTGAVGFA ALMAAAAVSC STIIGVDIHD SRLELARELG ATHTINSSSQ
DLHAELEKIT GGRGVNYALD TTARSSVVRD AADALGKRGV LIAVGAAAPG DEVSFEVGNS
LVKGWTFKTV IEGSAVPQVF IPRLVDLWKQ GKFPFDKLVK TYSLHDINTG FEDSASGAVI
KPVVAY