Gene Avi_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_3553 
Symbol 
ID7388817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp2947534 
End bp2948868 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID643652385 
Productcytosine deaminase-like protein 
Protein accessionYP_002550568 
Protein GI222149611 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGATGACCA GATTGTTTGC CGACATTCCG CAAACCGGGC GGTTTGCCCT GACACGCGCC 
ACCTTGCCTG TCGAAGCAGT TGATGATGTT CCCGCTGGAC CGGTCCGCGA GGGGCTGGTC
AGCGCCGATC TGATTATCAA CGACGGCAAG GTCGAAGCTA TTGTTAAGGT CGGCACCGCA
TCTCGTTACA AGACCGGAGC GGACCTGCCG ATCATCGATC TACGCGATGC CATGGTCTGG
CCGACCTTTA CCGACATGCA CACCCATCTC GACAAGGGCC ATATCTGGCC GCGAAAGCCC
AATCCAAAAG GCGATTTCAT CGGCGCATTG AGCGCCGTGA AGGATGACCG CGAGGCGAAT
TGGTCGGCGG ACGATGTGCG GGCACGGATG GAATTTTCGC TGCGCTGCGC CTATGCCCAT
GGCACCAGCC TGATCCGTAC CCATCTCGAC AGCAGCGCAC CTCAGCACCG GATTTCCTTT
GAGGTGTTTT CGCAGATACG CAAGGAGTGG GCGGGCCGGA TCGACCTTCA GGCGGTCGCC
CTTTTCCCCT TCGACGACAT CACCGATCAG GCGTTTTTTA GAGATTTGCT GGAGGTGCTT
GTTGCGCACA AGGGCATTCT CGGCGGCGTC ACCCAGGTCT CGCCGGATAT CGATCATCGG
CTGGACCTGT TGTTTCGCGC CGCAAGCGAC CACGGGCTCG ACATCGACCT GCATGTCGAT
GAGACCCAGG ATGCTTCCGT GCTGACCTTG AAATCCATTG CCGAGGCCAA GCTGCGCAAT
GGATTTCAAG GCTCGGTCGT GGTCGGCCAT TGCTGTTCAC TGACCCAGCA GAGCGACGAT
ATCGCCAAGG CCACCATCGA CAAGGTCGCG GAAGCCGGGC TTGCCGTCGT GTCACTACCG
ATGTGCAACA TGTATTTGCA GGATCGTCAT CCGGGCCGCA CGCCGCGCCA GCGCGGTGTC
ACCCTGTTTC ACGAACTGGC GGCGGCAGGT GTGCAGACGG CGGTTTCCTC CGACAATACC
CGCGATCCCT TCTATGCTTA TGGCGATCTC GATTGCGTGG AAGTGCTGCG CGAAGCGGTC
AGGATCGTCC ATCTCGATCA CCCGCTGGAC AGCACCGCCC GGATCGTCAC CCGCAGTCCC
GCCGATATTC TCGGACGTCC CGACCATGGC CGTATAAAGG TCGGGGCCAA GGCGGATCTG
GTGCTGTTTT CGGCGAGAAC CTGGAGCGAA TTGCTATCGC GTCCACAGTC TGACCGCACC
GTGTTGCGCT CCGGCCAGGC TATCGACGCG CAGGTGCCTG ACTACCGCGA CCTTGACCCT
TTGATGGAAG ATTGA
 
Protein sequence
MMTRLFADIP QTGRFALTRA TLPVEAVDDV PAGPVREGLV SADLIINDGK VEAIVKVGTA 
SRYKTGADLP IIDLRDAMVW PTFTDMHTHL DKGHIWPRKP NPKGDFIGAL SAVKDDREAN
WSADDVRARM EFSLRCAYAH GTSLIRTHLD SSAPQHRISF EVFSQIRKEW AGRIDLQAVA
LFPFDDITDQ AFFRDLLEVL VAHKGILGGV TQVSPDIDHR LDLLFRAASD HGLDIDLHVD
ETQDASVLTL KSIAEAKLRN GFQGSVVVGH CCSLTQQSDD IAKATIDKVA EAGLAVVSLP
MCNMYLQDRH PGRTPRQRGV TLFHELAAAG VQTAVSSDNT RDPFYAYGDL DCVEVLREAV
RIVHLDHPLD STARIVTRSP ADILGRPDHG RIKVGAKADL VLFSARTWSE LLSRPQSDRT
VLRSGQAIDA QVPDYRDLDP LMED