Gene Avi_5414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5414 
Symbol 
ID7381514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp415368 
End bp416759 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID643649022 
Productchlorohydrolase family protein 
Protein accessionYP_002547259 
Protein GI222106468 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATGACC AATACCGCGT ACCAGCCATG AGCGATTTCA GAAACCGGGT ACTGATCGCC 
AGCGGTTTTC ATGCCCCGGT GGCTGGCGAA ATCGATATGC TGACTGATTG CCTGATCGCT
ATTGATGCGG AGGGCATGAT CCTCTCCGTG CAGCGTCCCG GCGATGACGG ATATGCATTG
ACGAAAGCCG AAGCCGACCG CCAGGGGCGG CTATCACGCT TGCCTGCGGG TTGCCTGCTC
CTGCCGGGCC TGGTGGATTG CCATGTGCAT GCGCCACAAT ATCCTCAGCT TGGCACAGCG
CTGGATGTGC CGCTTGAAAC CTGGCTGCAT GCCCATACCT TCCCGCTGGA AGCCCGCTAT
GCGGACTTAG CCTATGCAAA GCGAGTCTAT GGCCTGCTGG TTGATGATCT GCTGGCCAAT
GGAACGACGA CGGCGCTGTA TTTCGCCACC ATCCATCAGG ATGCGACCCG CATTCTCGTC
GATACCTGCC TTGAAAAAGG CCAGCGCGCC CTGATCGGCA AAGTCGCCAT GGACAATGCC
GAACAATGCC CGGACTATTA CCGCGACGCC TCACCGGATG CGGCTTTGCA GGGAACACAA
GCGCTGATCG ACTATATCAG CACTCATCCC GACAACACGG CCTCCCGCGT CTGGCCGGTG
GTGACTCCAC GGTTCATTCC GGCCTGCACG GATGCGACGC TGGAAGGCTT GGGGGCTATG
GCCCAAGACT GTGGCTGCCA TGTGCAGACC CACTGTTCGG AAAGCGATTG GGAACACGCC
TATGTGCTAT CGCGCCACGG CATGACGGAT GCGATGAGCC TTGATCGCTT CGGCCTTCTC
ACCCGCCGCA GCATGCTTGC CCATGCCAAT CTGCTGACTG CTGATGATAT GGACCTGATC
AAGCTGCGAC AGACAGCGGT GGCGCATTGC CCGCTCTCCA ACGGCTATTT CGCTGGTGCG
GTCTTTCCCC TGCGGGCTGC CCTGGAAAAG GGCCTGCATG TCGGGCTGGG CAGCGATATT
TCCGGCGGAC CGAGCGCCTC GCTTCTGGAC AATATGCGCG CCGCCATTCT CGTCTCCCGC
ATGCTGGAAA CCGGGGTCGA TCCGGGCCTC CCGCCGGAAA AGCGCGCAAG CGGCACGAAG
GCCCGCATCG ACTTTCGCCA CGCCTTCCAC GTCGCCACCG CTGGCGGCGG CAAGGCATTG
GATCTGCCCA TCGGCCAATT TGCCCCGGGC TATCGCTTCG ATGCCATCGT CGTTGATCCG
CAGGCCGCCC AAGGCACGCT GCGGTTCTGC GAGAGTGATG AAATGACTGA GACCCTGCTC
CAGAAAATCG TCTTTACTGC ATCGCGTGCC AATATCTCTG CCGTGTTTAT CGACGGATGC
AAGGTGGCCT GA
 
Protein sequence
MHDQYRVPAM SDFRNRVLIA SGFHAPVAGE IDMLTDCLIA IDAEGMILSV QRPGDDGYAL 
TKAEADRQGR LSRLPAGCLL LPGLVDCHVH APQYPQLGTA LDVPLETWLH AHTFPLEARY
ADLAYAKRVY GLLVDDLLAN GTTTALYFAT IHQDATRILV DTCLEKGQRA LIGKVAMDNA
EQCPDYYRDA SPDAALQGTQ ALIDYISTHP DNTASRVWPV VTPRFIPACT DATLEGLGAM
AQDCGCHVQT HCSESDWEHA YVLSRHGMTD AMSLDRFGLL TRRSMLAHAN LLTADDMDLI
KLRQTAVAHC PLSNGYFAGA VFPLRAALEK GLHVGLGSDI SGGPSASLLD NMRAAILVSR
MLETGVDPGL PPEKRASGTK ARIDFRHAFH VATAGGGKAL DLPIGQFAPG YRFDAIVVDP
QAAQGTLRFC ESDEMTETLL QKIVFTASRA NISAVFIDGC KVA