Gene Gdia_2223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2223 
Symbol 
ID6975652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2465439 
End bp2467034 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content65% 
IMG OID643391751 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002276594 
Protein GI209544365 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.487572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.417914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA GCTGTGACTA TCTCGTCATC GGCGGGGGGT CGGCGGGCTG CGTCATGGCG 
GCGCTGCTGT CGGAAAATCC GGCGGCGCGC GTATGCATGA TCGAGGCCGG CGGCCCCGAT
ACCAATCCCC TGATCCACAT CCCGATCGGC TTCGCCAAGA TGACGACGGG ACCGCTGACC
TGGGGGCTGG CCACTGCGCC GCAGAAGCAC GCCAACAATC GTGAAATCCC TTATGTGCAG
GCCAAGGTCC TGGGCGGTGG GTCCTCGATC AATGCCGAAG TCTTCACGCG CGGCGTTCCG
TCGGATTATG ACCGCTGGGT GGAGGAAGGC GCCGAGGGCT GGGCCTTCAA GGACATCCAG
AAATACCTGA TCCGGTCCGA GGGCAATACG GCGCTGTCGG GCGAATGGCA TGGCACGAAC
GGCCCGCTGG GCGTGTCCAA CCCGACCTCG CCCAATCCGC TCAGCCTCGC CTTCGTGCAG
AGCTGCCAGG AATACGGCAT TCCGTACAAT CCCGACTTCA ACGGCCCCAG GCAGGAAGGG
GCCGGATTCT ACCAGTTGAC GGTCCGCAAC AGCCGGCGCT GCTCGGCGGC GGTCGGCTAT
CTGCGTCCGG CGCGCAAGCG CGCCAACCTG CATGTCATCA CCAGGGCGCA GGTCCTGCGC
ATCGCCTTCG AGGGCAAGCG CGCGAAGGGC GTCGTCTATG CGGTGGATGG CCAGGTCCGG
GAAGTGCGGG CGGAACAGGA AGTCATCGTC ACCTCCGGCG CCATCGGCAC GCCGAAACTG
CTGATGCTGT CGGGCATCGG GCCGGCCGCG CACCTGCAGG CCCATGACGT TCCGGTGGTG
CATGACCTGC CGGGCGTCGG CCAGAACCTG CAGGACCATT TCGGCGTGGA TATCGTCGCC
GAACTGAAAG ATCACGAAAG CTACAACCGG TACAACAAAT ATCACTGGGC GGCGTGGGCC
GGCCTGCAAT ACGCGCTGTT CCGCTCGGGT CCGCTGGCGT CCAACGTCGT GGAAGGCGGC
GCGTTCTGGT ATGCGGACCG CAACGCGCGC ACGCCCGACC TGCAATTCCA CTTCCTGGCC
GGCGCGGGGG CGGAAGCCGG GGTGGTCTCG GTGCCGAAGG GCGCGTCCGG CATTACCCTC
AACAGCTACA CGCTGCGCCC GAAATCGCGT GGCACGGTCA CGCTGCGGTC GTCCGACCCC
CGGGACAACC CGATCGTCGA TCCGAACTTC CTGGCCGACC CCGACGACCT GCGCATCTCG
GCCGAAGGCG TGAAGATCAG CGTGGAGATG TTCCGCCAAC CGTCGCTGCA GAAATACATC
AAGTCGATCA ACCTGTTCGA CGAGATCCGG CCGACGGCCC GCACCTACGA GGACTACACC
CGGCAGAACG GCCGGACATC CTATCACCCC ACCTGCACCT GCAAGATGGG CAAGGACCCG
ATGGCGGTGG TCGATTCGCA GCTTCGCATC CACGGGCTGG ACGGCATCCG CATCTGCGAC
AGCTCGGTCA TGCCGTCGCT GATCGGATCG AATACCAACG CGCCGACGAT CATGATCGCC
GAGCGCGCCG CCGACCTGAT CCGGGGCAAT GCCTAG
 
Protein sequence
MTESCDYLVI GGGSAGCVMA ALLSENPAAR VCMIEAGGPD TNPLIHIPIG FAKMTTGPLT 
WGLATAPQKH ANNREIPYVQ AKVLGGGSSI NAEVFTRGVP SDYDRWVEEG AEGWAFKDIQ
KYLIRSEGNT ALSGEWHGTN GPLGVSNPTS PNPLSLAFVQ SCQEYGIPYN PDFNGPRQEG
AGFYQLTVRN SRRCSAAVGY LRPARKRANL HVITRAQVLR IAFEGKRAKG VVYAVDGQVR
EVRAEQEVIV TSGAIGTPKL LMLSGIGPAA HLQAHDVPVV HDLPGVGQNL QDHFGVDIVA
ELKDHESYNR YNKYHWAAWA GLQYALFRSG PLASNVVEGG AFWYADRNAR TPDLQFHFLA
GAGAEAGVVS VPKGASGITL NSYTLRPKSR GTVTLRSSDP RDNPIVDPNF LADPDDLRIS
AEGVKISVEM FRQPSLQKYI KSINLFDEIR PTARTYEDYT RQNGRTSYHP TCTCKMGKDP
MAVVDSQLRI HGLDGIRICD SSVMPSLIGS NTNAPTIMIA ERAADLIRGN A