Gene Noca_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4005 
Symbol 
ID4598140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4227158 
End bp4228795 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content70% 
IMG OID639778610 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_925189 
Protein GI119718224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGA CGCCGTACAA GAACGAGGCC GACTACGTGG TGGTCGGCTC CGGCAGCTCG 
GGCGCCGCGA TCGCCGGCCG GTTGGCCCAG TCCGGCGCGA GCGTGATCGT GCTCGAGGCC
GGCAAGAGCG ACGAGCAGTA CCTGGTGAAG AAGCCCGGCA TGATCGGCCC GATGCACTCG
GTGCCCGAGA TCAAGAAGCG CGTCGACTGG GGCTACTACT CCACGCCGCA GAAGCACCTC
CTGGAGCGCA AGATGCCGGT CCCGCGCGGC AAGGTGGTCG GCGGCTCGAG CTCGATCAAC
GGCATGGTCT ACGTGCGCGG CAACCGCGCC AACTACGACT CCTGGGCAGC CGAGGGCTGC
ACCGGCTGGT CGGCCGACGA GGTCAACGCG GCGTACCGGC GCATGGAGGA CTTCGAGGAC
GGCGCCAACG ACTACCGCGG TGCGGGCGGG CCGATCAAGG TGACCCGCAA CGCCGCGCCG
CAGGAGGGCT CCCTGCAGTT CATCCAGGCG ACCTCGGACG TGCTGGGCGT GAAGGTCCTC
GACGACTACA ACGCCGAGTC GCAGGAGGGG GTCAGCCGGA TGCAGCAGAA CGCCGCCGGC
GGGCTGCGCT ACAGCGCGTC GCGGGGCTAC CTCCACCACC TCGACGTACC CACCCTGCAG
CTGCAGACCG AGGTGCTGGT GAGGAAGGTC GTCATCGAGA ACGGGCGGGC GACGGGCGTC
GAGGTCACCG ACAAGAGCGG CAGCCGGCGT ACCGTCCGGG CCGGCAAGGA GGTCATCCTC
TCGGCCGGCT TCGTCGGGTC CGCGCAGCTC CTGATGCTCT CCGGCATCGG CCCCGCCCAG
CACCTGCGCG ACCACGGCAT CGAGGTGCTC GCGGACCTGC CGGTGGGCGA CAACCTGCAC
GACCACATGT TCCACGCGCT GACCTTCCAC GTGACCTCCT CGAAGATGCG CGGGAACGCC
TTCTTCTTCG GCAAGGGCGT CCTCAAGGAG GCGCTGCGCC CCGGCAGGAC GTTCATGGCG
AACTCGGTGT TCGAGGCCGT CGCGTTCCTG CGTACGTCGC AGGCGACCGA CGTGCCCGAC
CTGCAGCTGC ACCTGCTGCC GTGGTCCTAC GTCTCGCCCA ACCAGGACGA GCCGATCCGC
CACGACGTCG ACCCGCGTAC GTCGATCACG CTGCTCTCGA CGCTGATCTA CCCGCGCAGC
CGCGGCACGT TGCGGCTCGC GTCCGACGAC CCGACCACCC CACCGCTGAT CGACTTCCAG
TACCTCGCCG ACCCCGGCGA CCTGGAGGTG CTCGCGGAGG GTTCGGAGAT GGTCCGGGAG
ATCATGGCCG GCGCCGCGTT CGGCGGTGCG GTCAAGGAGG AGATCCATCC GGGGGCCCGC
CTGAAGGGGC AGGAGCTGCG CGACGCGATC CTCAACCGGG CGACCTCGGT CTACCACGGC
GTCGGCACCT GCCGGATGGG CACCGACGAC CTGTCGGTCG TGACCCCCGA CCTCAAGGTC
CGCGGCGTCG AGAACCTCCG GGTCTGCGAC GCCTCGATCA TGCCGTCGAT CACCGGGGGC
AACACCAACG CGCCCGCGAT CATGATCGGT GAGCGCGGCG CGGACCTCGT TCTCGGCACC
GTTCTCGGAA GGGCGTGA
 
Protein sequence
MAKTPYKNEA DYVVVGSGSS GAAIAGRLAQ SGASVIVLEA GKSDEQYLVK KPGMIGPMHS 
VPEIKKRVDW GYYSTPQKHL LERKMPVPRG KVVGGSSSIN GMVYVRGNRA NYDSWAAEGC
TGWSADEVNA AYRRMEDFED GANDYRGAGG PIKVTRNAAP QEGSLQFIQA TSDVLGVKVL
DDYNAESQEG VSRMQQNAAG GLRYSASRGY LHHLDVPTLQ LQTEVLVRKV VIENGRATGV
EVTDKSGSRR TVRAGKEVIL SAGFVGSAQL LMLSGIGPAQ HLRDHGIEVL ADLPVGDNLH
DHMFHALTFH VTSSKMRGNA FFFGKGVLKE ALRPGRTFMA NSVFEAVAFL RTSQATDVPD
LQLHLLPWSY VSPNQDEPIR HDVDPRTSIT LLSTLIYPRS RGTLRLASDD PTTPPLIDFQ
YLADPGDLEV LAEGSEMVRE IMAGAAFGGA VKEEIHPGAR LKGQELRDAI LNRATSVYHG
VGTCRMGTDD LSVVTPDLKV RGVENLRVCD ASIMPSITGG NTNAPAIMIG ERGADLVLGT
VLGRA