Gene Avi_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_1144 
SymbolbetA 
ID7386209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp966004 
End bp967659 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content57% 
IMG OID643650615 
Productcholine dehydrogenase 
Protein accessionYP_002548821 
Protein GI222147864 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACC AGGCAGATTT CATCATCATC GGTTCCGGCT CGGCAGGCGC GGCCATGGCT 
TACCGCCTGT CGGAGGATGG CAAGCATACA GTGATCGTGC TGGAATTTGG CGGATCAGAC
ATTGGCCCCT TTGTGCAAAT GCCTGCCGCC CTCGCCTTTC CGATGAATAT GGACCGCTAC
AATTGGGGCT ATGTCACCGA GCCGGAACCA CACCTCAACA ATCGCCGCAT GATTGCGCCG
CGTGGCAAGG TGGTGGGTGG CTCATCCTCC ATCAACGGCA TGGTCTATGT GCGCGGCCAT
GCGGAGGATT TTAACCGTTG GGACGAGCTC GGAGCGACTG GTTGGTCCTA TGCCGATGTG
CTGCCTTACT TCAAGCGCAT GGAGCATTCT CATGGCGGTG AAGAGGGCTG GCGCGGCACC
GATGGTCCCC TGCATGTACG CCGGGGCGAG GTGAAAAATC CGCTGTATCA GGCTTTTATT
GATGCAGGTC AGCAGGCCGG ATTCCCCGTC ACTGAAGATT ACAATGGCCG CCAACAGGAA
GGCTTTGGCC TGATGGAACA GACCAGCTGG CAAGGCCGCC GCTGGTCCAC CGCCAATGCC
TATCTGAAAC CGGCGCTGAA GCGCGACAAT TGCCGCCTGA TCCGCTGTTT TGCCCGCAAG
ATTGTGCTGG ACGGCCGCCG CGCTGTGGGC GTGGAAGTGG AGATTGGCGG CAAGATCGAG
GTGATCCGCG CCAACCGCGA GGTGATTGTT GCCGCCTCCG CCTTTAACTC GCCCAAACTG
CTGCTGCTGT CTGGCATTGG CCCGGCGGCA CATTTGCGCG AGATGGGCAT TGATGTGGTG
GCGGATCGGC CCGGTGTTGG CCAGAACCTG CAAGATCATC TGGAATATTA CCACCAGTTC
AAATCGAAAC TGCCCATCAC CCTGCATTCC AAAAACAACT GGTTCTGGAA AGGCGTGGTC
GGCGCGCAAT GGCTGTTGTT CAAGAAAGGC CTTGGCACCT CCAACCAGTT CGAGGCCGCC
GCCTTTATCC GCTCAAGCGC GGGCGTAAAA TGGCCCGACC TGCAATACCA CTTCCTGCCG
ATTGCCGTGT CCTATGATGG CAAATCGTCA GTGGAGGGCC ATGGTTTTCA GGCCCATGTC
GGCTATAACA TGTCGAAATC GCGTGGCTCT GTGACCTTGC GCTCACCGGA TGTCAAAGAC
GCTCCCGTAT TGCGCTTCAA CTATATGAGC GATGCGGAAG ACTGGGTGAA ATTCCGCCAT
GCCGTGCGGA TCACCCGCGA TATTTTCGCG CAAAAAGCCT TTGATCCCTA TCGGGAATCC
GAAATCGCAC CGGGATCGAA AGTGCAAACC AATGACGAGA TCGACGCCTT CCTGCGCGAA
CATCTGGAAG GCGCCTATCA CCCCTGCGGC ACAGCCAAAA TGGGCAGTAA AGATGATCCA
ATGGCCGTGG TCGATCCCAC TTGCAAGGTG ATTGGCGTCG AAGGCCTACG GGTGGCGGAT
TCATCGATTT TCCCGCATGT CACCTATGGC AATCTCAACG GCCCCTCGAT CATGACCGGC
GAAAAAGCGT CCGATCATAT TCTTGGTCGT GATCCATTGC CCCGCAGCAA TCAGGAGCCC
TGGATTAATC CAGATTGGCA GTTAAGTGAT CGGTAA
 
Protein sequence
MENQADFIII GSGSAGAAMA YRLSEDGKHT VIVLEFGGSD IGPFVQMPAA LAFPMNMDRY 
NWGYVTEPEP HLNNRRMIAP RGKVVGGSSS INGMVYVRGH AEDFNRWDEL GATGWSYADV
LPYFKRMEHS HGGEEGWRGT DGPLHVRRGE VKNPLYQAFI DAGQQAGFPV TEDYNGRQQE
GFGLMEQTSW QGRRWSTANA YLKPALKRDN CRLIRCFARK IVLDGRRAVG VEVEIGGKIE
VIRANREVIV AASAFNSPKL LLLSGIGPAA HLREMGIDVV ADRPGVGQNL QDHLEYYHQF
KSKLPITLHS KNNWFWKGVV GAQWLLFKKG LGTSNQFEAA AFIRSSAGVK WPDLQYHFLP
IAVSYDGKSS VEGHGFQAHV GYNMSKSRGS VTLRSPDVKD APVLRFNYMS DAEDWVKFRH
AVRITRDIFA QKAFDPYRES EIAPGSKVQT NDEIDAFLRE HLEGAYHPCG TAKMGSKDDP
MAVVDPTCKV IGVEGLRVAD SSIFPHVTYG NLNGPSIMTG EKASDHILGR DPLPRSNQEP
WINPDWQLSD R