Gene Noca_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2097 
Symbol 
ID4595542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2240408 
End bp2241820 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content72% 
IMG OID639776700 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_923293 
Protein GI119716328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00213522 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC GCTACGACGT GATCGTGGTC GGGGCCGGGA CCTCGGGCCT CAACCTGGCC 
CGCGAGCTGG CGGCCGGCGG CCTGCACTGC CTGGTCCTGG AGGCGGGTGG CCGCTACGAC
CGGCACACCT ACCCCCGCAC CGAGGTCGAC GGCTCGGCCC AGCTGTTCTG GGGCGGCGGC
CTGGAGCTCA ACGCCGATGC GTCGCTGGCG ATCCTGCGCC CGAAGGTGGT CGGGGGCGGC
TCGATCGTGA ACCAGGCGCT GATGGACCGG TTCGACGACG TCGCGCTCGA CGACTTCCGG
GCGGCGAGCG GCGTCGACCT GTTCACCGAG ACCGCGATGG CGCCGTACTA CGACCGTGCC
GAGGCCACCA TCTGCCTGCA GACGGTGCCC GAGCGGCACC GCAACGGCAA CGCGACGATC
TTCGCCGAGG GCTTCAGCCG CAACGGCTAC CGGCACGCGC CGCTGCGGCG CGCCCAGTCC
GACTGCCGCT TCGAGGACGG CAACTCCTGC ATCGAGTGCC TCTCGGGCTG CCGGATCGAC
TCCAAGCAGT CCACCGCGAT CACGGCCCTG CCCGCGGCCG AGCGGCACGG CGCCGTGCTG
CTCGCCGACG TCGAGGTGAC CCGGGTCGCC GAGCGTCCCG ACCGGGTGAG CGTGACCGGC
CTGGTCGGCA AGCCCGGCAG CACGCGCACC GAGCAGACCT GGACGGCCGC CCGGCTGGTG
CTGGCCGCCG GGGCGATCGG GAACTCGCGG CTGCTGCTGT CCTCCGGGTT CGGCGCGGAG
CTGCCCGCGC TGGGGCGGAA CTTCTTCACC CACCCGCAGT ACATGAACTT CGGCGTCTTC
GACGAGCCGG TCCGGGCGCA CTCGGGACCG CTGCAGAACT ACAAGTCCGC CGATCCAGGG
TTCCGCCGAC AGGGGTTCAA GCTCGAGAAC GTGTTCGCCG GGCCGTCGTC CATCGCGATG
CTGATGCCGG GGTTCGGCGC GGCGCACCTG GCGCTGATGC GCCGCTACGA CCACCTGGGG
TGCATCGAGG TGTGCGTGCG CGACACCACC CCGGGCCGGA TCCGGCTCAA CCGCAAGGGC
GCCGTGGTGA TCGAGAAGCG GCTCGGCGCC GAGGACCTGC GTCGCCGCGA CGCCGGCGCC
GCGGCGATCC GGAACATCTT CTTGTCCATG GGTGCGCGCC GGCTGGTCGA GGGCGACCTG
GGGATCGGGC TGCACCTGAT GGGCGGCTGC GCGATCGGGA CCGATCCTGC CCGCTCGGTC
GTCGACCCCG ACTTCACCCT GCACGGCAGC CGGCGCATCC ACGCCGCCGA CTCGAGCGTG
TTCCCGAACG CACCGGGGAT CAACCCGGCG CTGACCATCG CCGCGCTCTC GATCCGGGCC
GGCGAGTCGA TCCTGGCCGC GGCGCGGAGA TGA
 
Protein sequence
MTERYDVIVV GAGTSGLNLA RELAAGGLHC LVLEAGGRYD RHTYPRTEVD GSAQLFWGGG 
LELNADASLA ILRPKVVGGG SIVNQALMDR FDDVALDDFR AASGVDLFTE TAMAPYYDRA
EATICLQTVP ERHRNGNATI FAEGFSRNGY RHAPLRRAQS DCRFEDGNSC IECLSGCRID
SKQSTAITAL PAAERHGAVL LADVEVTRVA ERPDRVSVTG LVGKPGSTRT EQTWTAARLV
LAAGAIGNSR LLLSSGFGAE LPALGRNFFT HPQYMNFGVF DEPVRAHSGP LQNYKSADPG
FRRQGFKLEN VFAGPSSIAM LMPGFGAAHL ALMRRYDHLG CIEVCVRDTT PGRIRLNRKG
AVVIEKRLGA EDLRRRDAGA AAIRNIFLSM GARRLVEGDL GIGLHLMGGC AIGTDPARSV
VDPDFTLHGS RRIHAADSSV FPNAPGINPA LTIAALSIRA GESILAAARR