Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5129 |
Symbol | |
ID | 7380933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 118171 |
End bp | 119460 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643648786 |
Product | hypothetical protein |
Protein accession | YP_002547023 |
Protein GI | 222106232 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.296521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGCA TTACCGATCT CAATGTTTTC GACCTGCGCT TTCCGACATC TGCAAGCCTC GACGGCTCCG ATGCCATGAA CCCGGACCCG GATTATTCAG CGGCATACGT GGTCCTCGAA ACGGATGAGC CTGGCCTGGA AGGCCATGGC CTCACCTTTA CCATTGGCCG CGGCAACGAT ATCTGCTGTA TGGCGATCAA GGCCATGCGC CATCTGGTGG TGGGCGTGGA TCTGGACGAA GTACGGGCTG CGCCTGGCAA GTTCTGGCGA AAGCTGACTG GCGATAGCCA GTTGCGCTGG ATCGGACCCG ACAAGGGCGC CATGCATCTG GCAACAGGCG CTGTCGTCAA TGCCTTCTGG GATGCCATGG CAAAGCAGGC CGGTCTGCCG GTGTGGCAGT TCGTATCGAC CATGTCACCG GAAGAGATCG CCGATATCGT CGATTATCGC TATCTCACCG ATGCGCTGAC CCGCGAGGAA GCTGTCGCGA TTTTACGCAA GGCGGAGGAC GGCAAGGCCG ACCGCATCGC GCTTCTCGAA AGGCAAGGCT ATCCCTGTTA CACAACCTCG GCCGGATGGC TGGGATATGA CGATGACAAG CTGCGCCGTC TGGCGCAGGA CGCCATCGAC CAGGGTTTCA ATCATATCAA GATGAAGGTT GGGCGTGATC TTGAGGATGA CATTCGTCGT CTGACGATTG CCCGCGAGGT GATCGGTCCG GATCGGTATC TGATGGTCGA TGCCAATCAG GTCTGGGAAG TCGGCCAGGC CATCGACTGG ATGAAAAAGC TCGCCTTTGC CAAACCTTTT TTCATCGAGG AGCCAACCAG CCCGGACGAT GTGGCAGGTC ACCGCAAGAT CCGCGATGCC ATTGCGCCGA TCAAGGTCGC AACCGGCGAG ATGTGCCAGA ACCGCATCAT GTTCAAGCAA TTCATCACCG AAGGCGCGAT TGACATCGTC CAGATCGATT CCTGCCGCAT GGGTGGGCTC AACGAAGTTC TCGCCGTACT GCTGCTGGCG GCGAAATATG AAAAGCCTGT CTGGCCGCAT GCGGGCGGTG TTGGGCTGTG TGAATATGTC CAGCATCTTT CGATGATCGA TTATATCGCG GTCTCCGGCA CCAGAGAGGG CCGCGTGATC GAATATGTAG ACCATCTGCA CGAACACTTC ATCGATCCCT GCGTCATCCG CAACGCCGCC TATATGCCGC CATCCCGCCC CGGCTTCTCC ATCGAGATGA AGCCTGAATC AATTGCTGCC TATACGTTCC CCACCTCAAA GGCAGTCTGA
|
Protein sequence | MTRITDLNVF DLRFPTSASL DGSDAMNPDP DYSAAYVVLE TDEPGLEGHG LTFTIGRGND ICCMAIKAMR HLVVGVDLDE VRAAPGKFWR KLTGDSQLRW IGPDKGAMHL ATGAVVNAFW DAMAKQAGLP VWQFVSTMSP EEIADIVDYR YLTDALTREE AVAILRKAED GKADRIALLE RQGYPCYTTS AGWLGYDDDK LRRLAQDAID QGFNHIKMKV GRDLEDDIRR LTIAREVIGP DRYLMVDANQ VWEVGQAIDW MKKLAFAKPF FIEEPTSPDD VAGHRKIRDA IAPIKVATGE MCQNRIMFKQ FITEGAIDIV QIDSCRMGGL NEVLAVLLLA AKYEKPVWPH AGGVGLCEYV QHLSMIDYIA VSGTREGRVI EYVDHLHEHF IDPCVIRNAA YMPPSRPGFS IEMKPESIAA YTFPTSKAV
|
| |