Gene Avi_5410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5410 
Symbol 
ID7381510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp408372 
End bp409754 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content63% 
IMG OID643649018 
Productcytosine deaminase-like protein 
Protein accessionYP_002547255 
Protein GI222106464 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0215382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGACA GTTCCGTCCG CACGCTGGCA CGTACCAGCT TTACCGGCCA TGGCCGCATT 
GTTATCCGTC AGGCCCGCGT GCCAACGGTC TGTCTTGCGG ACGGTGTGCC GCCGGACAGC
GCTCTCGACC GCGACGGCTG TGCTCTGGTG GATATCCTGC TGGACGACGG ACGTGTTGCA
GCCGTGCTGC CAGCGGCAAG CGGGGTGACC GGCGGTGGCA TCGATCTGGA AGGCCGCCAT
CTCTGGCCGT TGATGATCGA CATGCATGTC CATCTCGACA AGGGCCATAC GATCAGCCGG
GCACCAGCCT CCGATGGCAC GCATCCCAGC GCTCGTGCGG CGACGACAGC TGACCGGCAG
GCCCATTGGC GTCACGACGA TCTGGTCCGG CGGATGGAAT TCGGTCTTGC CTGCGCGGAT
GCGCACGGCG TTGCCGCGCT CCGCACCCAT CTCGATAGCC ATGCGGGGCA GGCGGAAACC
ACCTGGGCGG CATTTGATGA GGTGCAGGCC CGTTGGTCGG GGCGCATTGC GTTGCAGGCG
GTCGGCCTCG TGCCGCTGGA TGCCTATCGC ACCGACCATG GGAAGCAGCT TGCCGACCTG
ATCGCGCGTC AGGGCGGGCT ATTGGGCGGG GTAACGCGCG CCTCGGGTGG CACTCATGGT
ATCGGGAAAA ATGGCACCGG GCTTGATGAT ATCGATGCCT TGCTCGACAG CCTTTTTCTG
CTGGCCCGCG AGCGTGATCT CGATATCGAC CTGCATGTGG ATGAAGCTGA AAAGGCCGAT
GCCTTGCCGC ATGTGGCCCG CGCTGCCATC CGCCACGGCT ATGAGGGTCG CGTTACCTGC
GGCCATTGCT GTTCGCTGGC TTTGTTCAGC GAGACGGAAA TCCGCGAGCG GATCGCGCTT
CTGGCCGATG CCGGACTGTC TATCGTCACG CTTCCGACCG TCAATATGTA TTTGCAGGGC
CGGGCGCAGG GTATAACCCC GCGTTGGCGT GGTGTTACGC CTGTCAAGGA GCTGCGCGCC
GCCGGTATCC GCGTCGCGGT TGCTGGTGAC AATTGCCGCG ATCCGTTTTT TGCCTATGGC
GATCATGATA TGGTCGATAC CTGGCGCCAA TCGGTGCGCA TTCTGCATCT CGACCATCCC
TATGACGATG CCGTGGCACT GGCGACCACC CAACCCGCCG CGATGACCGG TTTTTCCACT
GGAACCATCG GGGCCGGGCG TCCGGCGGAT CTGATGATCT TTGAGGCCTG GAGCATGGAC
CAGGTCATTG CCCGCCCGCA AACCGACCGG GTGATCGTGC GGGCAGGCAG GGTCAGTGAG
GCAGTGTTGC CATCCTATCG GGAGATCGAA TTGCCCTTTC TCACCCCATC ACCCTCATCC
TGA
 
Protein sequence
MIDSSVRTLA RTSFTGHGRI VIRQARVPTV CLADGVPPDS ALDRDGCALV DILLDDGRVA 
AVLPAASGVT GGGIDLEGRH LWPLMIDMHV HLDKGHTISR APASDGTHPS ARAATTADRQ
AHWRHDDLVR RMEFGLACAD AHGVAALRTH LDSHAGQAET TWAAFDEVQA RWSGRIALQA
VGLVPLDAYR TDHGKQLADL IARQGGLLGG VTRASGGTHG IGKNGTGLDD IDALLDSLFL
LARERDLDID LHVDEAEKAD ALPHVARAAI RHGYEGRVTC GHCCSLALFS ETEIRERIAL
LADAGLSIVT LPTVNMYLQG RAQGITPRWR GVTPVKELRA AGIRVAVAGD NCRDPFFAYG
DHDMVDTWRQ SVRILHLDHP YDDAVALATT QPAAMTGFST GTIGAGRPAD LMIFEAWSMD
QVIARPQTDR VIVRAGRVSE AVLPSYREIE LPFLTPSPSS