Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_1144 |
Symbol | betA |
ID | 7386209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 966004 |
End bp | 967659 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643650615 |
Product | choline dehydrogenase |
Protein accession | YP_002548821 |
Protein GI | 222147864 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC AGGCAGATTT CATCATCATC GGTTCCGGCT CGGCAGGCGC GGCCATGGCT TACCGCCTGT CGGAGGATGG CAAGCATACA GTGATCGTGC TGGAATTTGG CGGATCAGAC ATTGGCCCCT TTGTGCAAAT GCCTGCCGCC CTCGCCTTTC CGATGAATAT GGACCGCTAC AATTGGGGCT ATGTCACCGA GCCGGAACCA CACCTCAACA ATCGCCGCAT GATTGCGCCG CGTGGCAAGG TGGTGGGTGG CTCATCCTCC ATCAACGGCA TGGTCTATGT GCGCGGCCAT GCGGAGGATT TTAACCGTTG GGACGAGCTC GGAGCGACTG GTTGGTCCTA TGCCGATGTG CTGCCTTACT TCAAGCGCAT GGAGCATTCT CATGGCGGTG AAGAGGGCTG GCGCGGCACC GATGGTCCCC TGCATGTACG CCGGGGCGAG GTGAAAAATC CGCTGTATCA GGCTTTTATT GATGCAGGTC AGCAGGCCGG ATTCCCCGTC ACTGAAGATT ACAATGGCCG CCAACAGGAA GGCTTTGGCC TGATGGAACA GACCAGCTGG CAAGGCCGCC GCTGGTCCAC CGCCAATGCC TATCTGAAAC CGGCGCTGAA GCGCGACAAT TGCCGCCTGA TCCGCTGTTT TGCCCGCAAG ATTGTGCTGG ACGGCCGCCG CGCTGTGGGC GTGGAAGTGG AGATTGGCGG CAAGATCGAG GTGATCCGCG CCAACCGCGA GGTGATTGTT GCCGCCTCCG CCTTTAACTC GCCCAAACTG CTGCTGCTGT CTGGCATTGG CCCGGCGGCA CATTTGCGCG AGATGGGCAT TGATGTGGTG GCGGATCGGC CCGGTGTTGG CCAGAACCTG CAAGATCATC TGGAATATTA CCACCAGTTC AAATCGAAAC TGCCCATCAC CCTGCATTCC AAAAACAACT GGTTCTGGAA AGGCGTGGTC GGCGCGCAAT GGCTGTTGTT CAAGAAAGGC CTTGGCACCT CCAACCAGTT CGAGGCCGCC GCCTTTATCC GCTCAAGCGC GGGCGTAAAA TGGCCCGACC TGCAATACCA CTTCCTGCCG ATTGCCGTGT CCTATGATGG CAAATCGTCA GTGGAGGGCC ATGGTTTTCA GGCCCATGTC GGCTATAACA TGTCGAAATC GCGTGGCTCT GTGACCTTGC GCTCACCGGA TGTCAAAGAC GCTCCCGTAT TGCGCTTCAA CTATATGAGC GATGCGGAAG ACTGGGTGAA ATTCCGCCAT GCCGTGCGGA TCACCCGCGA TATTTTCGCG CAAAAAGCCT TTGATCCCTA TCGGGAATCC GAAATCGCAC CGGGATCGAA AGTGCAAACC AATGACGAGA TCGACGCCTT CCTGCGCGAA CATCTGGAAG GCGCCTATCA CCCCTGCGGC ACAGCCAAAA TGGGCAGTAA AGATGATCCA ATGGCCGTGG TCGATCCCAC TTGCAAGGTG ATTGGCGTCG AAGGCCTACG GGTGGCGGAT TCATCGATTT TCCCGCATGT CACCTATGGC AATCTCAACG GCCCCTCGAT CATGACCGGC GAAAAAGCGT CCGATCATAT TCTTGGTCGT GATCCATTGC CCCGCAGCAA TCAGGAGCCC TGGATTAATC CAGATTGGCA GTTAAGTGAT CGGTAA
|
Protein sequence | MENQADFIII GSGSAGAAMA YRLSEDGKHT VIVLEFGGSD IGPFVQMPAA LAFPMNMDRY NWGYVTEPEP HLNNRRMIAP RGKVVGGSSS INGMVYVRGH AEDFNRWDEL GATGWSYADV LPYFKRMEHS HGGEEGWRGT DGPLHVRRGE VKNPLYQAFI DAGQQAGFPV TEDYNGRQQE GFGLMEQTSW QGRRWSTANA YLKPALKRDN CRLIRCFARK IVLDGRRAVG VEVEIGGKIE VIRANREVIV AASAFNSPKL LLLSGIGPAA HLREMGIDVV ADRPGVGQNL QDHLEYYHQF KSKLPITLHS KNNWFWKGVV GAQWLLFKKG LGTSNQFEAA AFIRSSAGVK WPDLQYHFLP IAVSYDGKSS VEGHGFQAHV GYNMSKSRGS VTLRSPDVKD APVLRFNYMS DAEDWVKFRH AVRITRDIFA QKAFDPYRES EIAPGSKVQT NDEIDAFLRE HLEGAYHPCG TAKMGSKDDP MAVVDPTCKV IGVEGLRVAD SSIFPHVTYG NLNGPSIMTG EKASDHILGR DPLPRSNQEP WINPDWQLSD R
|
| |