Gene Avi_5959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5959 
SymbolhutI 
ID7381044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp975552 
End bp976772 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content60% 
IMG OID643649472 
Productimidazolonepropionase 
Protein accessionYP_002547703 
Protein GI222106912 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.946699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACATC GGGATCGCAT TTTCACCAAT GCGCGGTTAG CAACGCTCAA TCCGCAACTA 
TCCGGCCTCG GCATTATCGA AGATGCCGCG CTGATGGTGC GCGATGGGCA GCTTGTCTAC
GCAGGGCCTA TGGCAGAGCT GCCGATTAGC CTGCTGAATG CTGCCGAGGT GACCGATTGC
GAGGGCCGCT GGATCACCCC CGGTCTGGTG GATTGCCATA CCCATCTGGT GCATGCTGGC
AATCGCGCCC ATGAATTTGA GATGCGCCTA GCGGGTGCCA GCTACGAAGA AATCGCCCGC
GCCGGAGGCG GTATTGTCTC ATCCGTCTCC AAGGTGCGGG CGGCGAGTGA GGCGGATTTG
CTACGCGAAA CACTGCCGCG TCTGGACGCG CTGCTGGCGG AGGGCGTCAC CACCATCGAG
GTTAAATCCG GCTATGGGCT GACGGTGGAA GACGAGCTGA AAATGCTGCG GGCCGCAAAA
AAACTCGGCG ATGCGCGCCC CATTGCCATC AGCACCACTT ATCTCGGTGC CCATGCCACG
CCTGCCGAGT ATAAGGGCCG CAACGGCGAT TTTATCCGCG AGGTTGTGCT TCCCGGTTTG
ACGGCGGCCC ATGCAGAACA GCTTGTTGAT GCCGTCGATG GGTTTTGCGA GGGCATTGCC
TTTTCACCCG ATGAAATGCG CGTGGTGTTT GATGCAGCAC AGGCTCTGGG CCTGCCGGTA
AAACTGCATG CGGATCAGCT TTCCAATCTA TCCGGTGCAG CGCTTGCTGC GGAGTATGGC
GCACTCTCTG CCGACCATCT GGAATATACC GATGCGGCGG GCGCGGATGC CATGGCCAAA
GCTGGCACCG TGGCGGTGCT GCTGCCGGGT GCCTTTTACT TTATCCGTGA GACAAAAAAG
CCGCCTGTCG ATCTCTTCCG CCAGCATGGC ACCAAAATGG CACTGGCCAC CGACAACAAC
CCCGGCACAT CGCCACTCAC CTCGCTGCTG CTGACCATGA ATATGGGTGC CACCCTGTTT
GGCATGACGG TGGAGGAATG CATAGCGGGC GTGACCCGTG AGGCCGCCCG CGCATTGGGT
CGGCTGGATG AAATCGGCAC GCTGGAAGCC GGAAAATCGG CTGATCTGGC CATCTGGGAT
ATTTCTGAAC TGTCTGAGCT TGTCTACCGC ATGGGCTTTA ACCCGCTGCA CCAGCGGGTG
TGGCGCGGCA ATGACGCATA A
 
Protein sequence
MTHRDRIFTN ARLATLNPQL SGLGIIEDAA LMVRDGQLVY AGPMAELPIS LLNAAEVTDC 
EGRWITPGLV DCHTHLVHAG NRAHEFEMRL AGASYEEIAR AGGGIVSSVS KVRAASEADL
LRETLPRLDA LLAEGVTTIE VKSGYGLTVE DELKMLRAAK KLGDARPIAI STTYLGAHAT
PAEYKGRNGD FIREVVLPGL TAAHAEQLVD AVDGFCEGIA FSPDEMRVVF DAAQALGLPV
KLHADQLSNL SGAALAAEYG ALSADHLEYT DAAGADAMAK AGTVAVLLPG AFYFIRETKK
PPVDLFRQHG TKMALATDNN PGTSPLTSLL LTMNMGATLF GMTVEECIAG VTREAARALG
RLDEIGTLEA GKSADLAIWD ISELSELVYR MGFNPLHQRV WRGNDA