Gene Avi_5431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5431 
Symbol 
ID7381527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp430113 
End bp431513 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content59% 
IMG OID643649034 
Productchlorohydrolase 
Protein accessionYP_002547271 
Protein GI222106480 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.225763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGA CTTCGAGCGC AATGATCGTC ACCGCCGACA CTCTGTTGAC CATGGATGCG 
AAAAACAGCG TCATCTCTGA TGGCGCGGTT GCTATTGAGG ACGGTCGTAT CCTGGCGGTC
GGTTCGCTGG AGGTGGTGAA GGCCAGCCAT CCGGCTCTGG CGATCAAGAG GATCGACAAT
GCCTTGCTGA TGCCGGGGCT GATCAACGCC CATGCCCATT CCGGCTTCCT GCGCGGCACA
GCGGAACATC TGCCGGTCTG GGACTGGCTG ACCATCCATA TAAACCCGAT GCACCGCGTG
CTTCTGCCGC ATGAAGCCGA GGCGGCGTCC TTTCTGTGCT ACGTCGAATC CGCACTATCC
GGCACGACGA CTGTTGTCGA TATGTGGCGC TATATGGATG GCAGTGCGCG GGCGGCACAG
TCCATCGGCA CGCGCCTGGT TGCTGTTCCG TATGTTGGTG AGCATCCCGA TTACAATTAT
TTCGAAACGC TCGACAACAA TGAGGCGATG ATTGAGACCT GGCATCGCAA GGCGGGGGGC
CGCATCAATG TCTGGGTCGG GCTGGAACAT CTGTTTTATG CCGATGCGGC TGGCCAGCAG
CGGGCCATCG CCATGGCCAA ACAGTATAAC ACCGGCTTTC ACACCCATTG TTCGGAAGCC
GAAGTCGAGG TTGGCGGCTT TATCGACACC TATGGCAAGC GCCCCATGCA TGTTCTGGAG
GATCTCGGCT TCTTCGAGGC TCCACGCACC ATGCTGGCCC ATGCCGTCTG GCTGGATGAG
GCGGAAATCG AGCTGATTGC CAGATACAAT GTCTCGGTCG CCCATAATCC GGTGTCGAAT
ATGAAGCTTG CTTCCGGCAT TGCACCGATT GCCGATATGC TGGCGGCGGG CATTCCGGTC
GGTCTTGGCA CGGATGGCGA AAAGGAAAAC AACAATTTCG ACATGTTCGA GGAGATGAAG
ACCGCCTCCC TGCTCGGCAA ACTGCGCCAC CGCGATGCCG CCGCCATGGA TAGCTGGCAA
TGTCTGCGCA TGGCGACCAT TCTCGGCGCG AGAGCTATCG GTCTTGAGGA TGAAATCGGC
TCTATCGAAG TTGGAAAGCG CGCCGATATC ATTGCGGTGC GCACCGATAC GCCGCGTATG
ACGCCACTGT TTGCCGACGG TCCCTATTTC AATGTGCAGC ACAATCTCGT CCACGCGGTG
CGCGGCGGCG ATGTCGCCAT GACCATGGTG GATGGTCAGG TGATCGTGGA AGATGGCGTG
CTGAAAACCG GCGACGTCAA GGCGATCATT GCCGATATCC ATGGCATGGC GCCTTCGCAT
TTTGCCCGCC GCGCCGCATG GCTTGCCGAA AACGGCGGCG GCACCAAGCA ATGGATCAGC
GAAGCGGAGG GTGTCAAATG A
 
Protein sequence
MSMTSSAMIV TADTLLTMDA KNSVISDGAV AIEDGRILAV GSLEVVKASH PALAIKRIDN 
ALLMPGLINA HAHSGFLRGT AEHLPVWDWL TIHINPMHRV LLPHEAEAAS FLCYVESALS
GTTTVVDMWR YMDGSARAAQ SIGTRLVAVP YVGEHPDYNY FETLDNNEAM IETWHRKAGG
RINVWVGLEH LFYADAAGQQ RAIAMAKQYN TGFHTHCSEA EVEVGGFIDT YGKRPMHVLE
DLGFFEAPRT MLAHAVWLDE AEIELIARYN VSVAHNPVSN MKLASGIAPI ADMLAAGIPV
GLGTDGEKEN NNFDMFEEMK TASLLGKLRH RDAAAMDSWQ CLRMATILGA RAIGLEDEIG
SIEVGKRADI IAVRTDTPRM TPLFADGPYF NVQHNLVHAV RGGDVAMTMV DGQVIVEDGV
LKTGDVKAII ADIHGMAPSH FARRAAWLAE NGGGTKQWIS EAEGVK