Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5959 |
Symbol | hutI |
ID | 7381044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 975552 |
End bp | 976772 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643649472 |
Product | imidazolonepropionase |
Protein accession | YP_002547703 |
Protein GI | 222106912 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.946699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACATC GGGATCGCAT TTTCACCAAT GCGCGGTTAG CAACGCTCAA TCCGCAACTA TCCGGCCTCG GCATTATCGA AGATGCCGCG CTGATGGTGC GCGATGGGCA GCTTGTCTAC GCAGGGCCTA TGGCAGAGCT GCCGATTAGC CTGCTGAATG CTGCCGAGGT GACCGATTGC GAGGGCCGCT GGATCACCCC CGGTCTGGTG GATTGCCATA CCCATCTGGT GCATGCTGGC AATCGCGCCC ATGAATTTGA GATGCGCCTA GCGGGTGCCA GCTACGAAGA AATCGCCCGC GCCGGAGGCG GTATTGTCTC ATCCGTCTCC AAGGTGCGGG CGGCGAGTGA GGCGGATTTG CTACGCGAAA CACTGCCGCG TCTGGACGCG CTGCTGGCGG AGGGCGTCAC CACCATCGAG GTTAAATCCG GCTATGGGCT GACGGTGGAA GACGAGCTGA AAATGCTGCG GGCCGCAAAA AAACTCGGCG ATGCGCGCCC CATTGCCATC AGCACCACTT ATCTCGGTGC CCATGCCACG CCTGCCGAGT ATAAGGGCCG CAACGGCGAT TTTATCCGCG AGGTTGTGCT TCCCGGTTTG ACGGCGGCCC ATGCAGAACA GCTTGTTGAT GCCGTCGATG GGTTTTGCGA GGGCATTGCC TTTTCACCCG ATGAAATGCG CGTGGTGTTT GATGCAGCAC AGGCTCTGGG CCTGCCGGTA AAACTGCATG CGGATCAGCT TTCCAATCTA TCCGGTGCAG CGCTTGCTGC GGAGTATGGC GCACTCTCTG CCGACCATCT GGAATATACC GATGCGGCGG GCGCGGATGC CATGGCCAAA GCTGGCACCG TGGCGGTGCT GCTGCCGGGT GCCTTTTACT TTATCCGTGA GACAAAAAAG CCGCCTGTCG ATCTCTTCCG CCAGCATGGC ACCAAAATGG CACTGGCCAC CGACAACAAC CCCGGCACAT CGCCACTCAC CTCGCTGCTG CTGACCATGA ATATGGGTGC CACCCTGTTT GGCATGACGG TGGAGGAATG CATAGCGGGC GTGACCCGTG AGGCCGCCCG CGCATTGGGT CGGCTGGATG AAATCGGCAC GCTGGAAGCC GGAAAATCGG CTGATCTGGC CATCTGGGAT ATTTCTGAAC TGTCTGAGCT TGTCTACCGC ATGGGCTTTA ACCCGCTGCA CCAGCGGGTG TGGCGCGGCA ATGACGCATA A
|
Protein sequence | MTHRDRIFTN ARLATLNPQL SGLGIIEDAA LMVRDGQLVY AGPMAELPIS LLNAAEVTDC EGRWITPGLV DCHTHLVHAG NRAHEFEMRL AGASYEEIAR AGGGIVSSVS KVRAASEADL LRETLPRLDA LLAEGVTTIE VKSGYGLTVE DELKMLRAAK KLGDARPIAI STTYLGAHAT PAEYKGRNGD FIREVVLPGL TAAHAEQLVD AVDGFCEGIA FSPDEMRVVF DAAQALGLPV KLHADQLSNL SGAALAAEYG ALSADHLEYT DAAGADAMAK AGTVAVLLPG AFYFIRETKK PPVDLFRQHG TKMALATDNN PGTSPLTSLL LTMNMGATLF GMTVEECIAG VTREAARALG RLDEIGTLEA GKSADLAIWD ISELSELVYR MGFNPLHQRV WRGNDA
|
| |