Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4122 |
Symbol | sucA |
ID | 7388914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3476471 |
End bp | 3479467 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643652822 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_002550995 |
Protein GI | 222150038 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGGC AAGAAGCCAA CGAGCAGTTT CAGATCACCT CGTTCCTGGA CGGATCGAAT GCGATCTATA TCGAGCAGCT TTATGCGCGT TACGAAGACG ATCCGAATTC GGTATCGCCT GAGTGGCAAT CCTTCTTCAA GGCGCTGGGC GATAATCCAA GCGATGTGAA AAAGGCGGCC AAGGGAGCCT CCTGGCAGCG GAGCAATTGG CCGCTGACGC CTCGTACCGA TCTGGTATCG GCGCTGGATG GCAATTGGGG TCTGGTCGAG AAGGCCATTG AAACCAAGGT CAAGGGCAAG GCAGAAGCCG CAGCCGCAAC CACGGGTAAA CCGGTTTCTG AAACCGATGT GTTGCAGGCG ACCCGTGATA GCGTGCGCGC CATCATGATG ATCCGCGCAT ATCGCATGCG CGGTCATTTG CACGCCAAGC TCGACCCGCT GGGCATTGCT ACGGCGGTTG AAGATTACAA CGAGCTGTCA CCATTGTCTT ACGGCTTTAC CGAAGCCGAT TTTGACCGCA AGATCTTCAT CGATAACGTT CTGGGGCTGG AATATGCCAC CGTGCGTGAG ATGATCGAAA TTCTTGAGCG CACCTATTGC TCGACGCTCG GTTTCGAATT CATGCACATC TCCAATCCGG AAGAAAAATC CTGGATTCAG GAGCGCATCG AAGGCCCGGA TAAAGGCGTC GATTTCACCG TCGAGGGCAA GAAGGCCATT CTGCAAAAGC TGGTTGAGGC CGAAGGCTTT GAGCAGTTTA TCGATGTGAA GTACAAGGGC ACCAAGCGCT TTGGTTTGGA TGGCGGCGAG TCGCTGATCC CGGCGCTGGA GCAGATCATC AAGCGCGGTG GTCAGGAGGG TCTTGAAGAG GTCGTGCTTG GCATGGCGCA TCGTGGCCGT CTGAACGTGC TGACCAATGT GATGCACAAG CCGCATCGCG CCGTGTTCCA CGAGTTCAAG GGCGGTTCGT TCAAGCCTGA TGAAGTTGAA GGCTCCGGCG ACGTGAAGTA CCATCTTGGT GCCTCGTCTG ACCGCGAATT CGATGGCAAC AAGGTTCACT TGTCGCTGAC GGCCAATCCG TCGCATCTGG AAATCGTTAA CCCTGTGGTG ATGGGTAAGG CCCGCGCCAA GCAGGACCAG CTGGCCAAAG TCTGGGAAGG CGACGTGATT CCGCTGAAGG AACGCGCCAA GGTTCTGCCG CTGCTGCTGC ATGGTGACGC TGCCTTTGCG GGTCAGGGCG TTGTAGCTGA AATTCTCGGC CTATCCGGTC TGCGCGGTCA CCGCGTGGCT GGCACGATGC ATGTGATCAT CAACAACCAG ATCGGCTTTA CCACCAATCC GGGCTTCTCG CGCTCCTCGC CTTATCCATC GGATGTGGCC AAGATGATCG AAGCGCCGAT CTTCCACGTC AATGGGGATG ATCCGGAAGC TGTAGTTTAT GCCGCCAAGG TGGCGACTGA ATTCCGCATG AAGTTCCACA AGCCTGTGGT TGTGGACATG TTCTGCTACC GTCGATTTGG CCATAATGAA GGTGATGAGC CTTCGTTCAC CCAGCCGAAG ATGTACAAGG AAATCCGCGC CCATAAGACC GTTGTTCAGG TCTATGGCGA TCGCTTGATT GCTGAAGGCG TGATTACCGA AGGCGATCTC GAAAAGATGA AGGCCGATTG GCGCGCCAAT CTCGAGCAGG AGTTCGAGGC TGGCCAGTCC TACAAGCCTA ACAAGGCTGA CTGGCTGGAT GGCGTCTGGT CCGGTCTTCG CGCTGCCGAC AATGCCGATG AGCAGCGTCG CGGCAAGACA GCCATGCCGA TGAAGTCCCT GAAGGAGATC GGCCGCAAGC TGTCGACCAT TCCTGATGGC TTCAAAGCGC ATCGCACCAT CCAGCGCTTC ATGGAAAACC GCGCCCAGAT GATCGAGACC GGCGAGGGTA TCGATTGGGC GATGGCCGAG GCACTGGCTT TCGGTTCGCT GGTCGTTGAA GGCCACAAGA TCCGCCTGTC CGGTCAGGAT TGCGAGCGCG GCACATTCAG CCAGCGTCAT TCGGTGCTCT ACGATCAGGA AAGCGAAGAT CGCTACATTC CGCTGGCCAA TCTGGCCCCC AATCAGGCTC GCTACGAAGT CATCAATTCG ATGCTGTCGG AAGAGGCCGT GCTTGGCTTC GAATACGGCT ATTCGCTGGC ACGTCCGAAT GCGTTGACCC TGTGGGAAGC CCAGTTCGGT GACTTCGCCA ACGGCGCCCA GGTGGTGTTC GACCAGTTCA TCTCGTCGGG TGAACGCAAG TGGCTTCGCA TGTCCGGCCT CGTCTGCCTT CTGCCGCATG GCTATGAAGG TCAGGGACCG GAACACTCTT CGGCTCGTCT GGAGCGCTGG CTGCAAATGT GCGCCGAAGA CAACATGCAG GTTGCCAACG TCACGACGCC GAGCAATTAC TACCATATCC TGCGCCGTCA GGTGAAACGC GACTTCCGCA AGCCGCTGAT CCTGATGACG CCGAAGTCGC TGCTGCGCCA CAAGCGGGCC CAGTCTACCC TGGCGGAAAT GGCAGGCGAA AGCTCTTTCC ATCGCCTGCT GTGGGATGAT GCCGAGATCA TCAAGGACGG TCCGATCAAG CTCCAGAAGG ATGCCAAGAT CCGCCGCGTG GTGATGTGCA GTGGCAAAGT TTATTATGAC CTGCTGGAAG AGCGCGAAAA GCGTGGCATC GACGATGTCT ACCTGCTGCG TATCGAACAG CTCTATCCGT TCCCGGCCAA GGCGCTGATC AATGAGCTCA GCCGCTTCCG CAATGCGGAA ATGGTCTGGT GCCAGGAAGA GCCGAAGAAC ATGGGGGCCT GGTCCTTCAT CGATCCGTAC CTGGAATGGG TGCTGGCGCA TATCGACGCC AAGTATCAGC GCGTGCGCTA TACCGGCCGC CCGGCTGCCG CCTCTCCAGC GACGGGTCTG ATGTCCAAGC ATCTCGCCCA ACTGCAAGCC TTCCTGGAGG ACGCACTCGG CGGCTGA
|
Protein sequence | MARQEANEQF QITSFLDGSN AIYIEQLYAR YEDDPNSVSP EWQSFFKALG DNPSDVKKAA KGASWQRSNW PLTPRTDLVS ALDGNWGLVE KAIETKVKGK AEAAAATTGK PVSETDVLQA TRDSVRAIMM IRAYRMRGHL HAKLDPLGIA TAVEDYNELS PLSYGFTEAD FDRKIFIDNV LGLEYATVRE MIEILERTYC STLGFEFMHI SNPEEKSWIQ ERIEGPDKGV DFTVEGKKAI LQKLVEAEGF EQFIDVKYKG TKRFGLDGGE SLIPALEQII KRGGQEGLEE VVLGMAHRGR LNVLTNVMHK PHRAVFHEFK GGSFKPDEVE GSGDVKYHLG ASSDREFDGN KVHLSLTANP SHLEIVNPVV MGKARAKQDQ LAKVWEGDVI PLKERAKVLP LLLHGDAAFA GQGVVAEILG LSGLRGHRVA GTMHVIINNQ IGFTTNPGFS RSSPYPSDVA KMIEAPIFHV NGDDPEAVVY AAKVATEFRM KFHKPVVVDM FCYRRFGHNE GDEPSFTQPK MYKEIRAHKT VVQVYGDRLI AEGVITEGDL EKMKADWRAN LEQEFEAGQS YKPNKADWLD GVWSGLRAAD NADEQRRGKT AMPMKSLKEI GRKLSTIPDG FKAHRTIQRF MENRAQMIET GEGIDWAMAE ALAFGSLVVE GHKIRLSGQD CERGTFSQRH SVLYDQESED RYIPLANLAP NQARYEVINS MLSEEAVLGF EYGYSLARPN ALTLWEAQFG DFANGAQVVF DQFISSGERK WLRMSGLVCL LPHGYEGQGP EHSSARLERW LQMCAEDNMQ VANVTTPSNY YHILRRQVKR DFRKPLILMT PKSLLRHKRA QSTLAEMAGE SSFHRLLWDD AEIIKDGPIK LQKDAKIRRV VMCSGKVYYD LLEEREKRGI DDVYLLRIEQ LYPFPAKALI NELSRFRNAE MVWCQEEPKN MGAWSFIDPY LEWVLAHIDA KYQRVRYTGR PAAASPATGL MSKHLAQLQA FLEDALGG
|
| |