Gene Avi_5090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5090 
Symbol 
ID7380903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp83201 
End bp84412 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content59% 
IMG OID643648756 
Productdihydroorotase 
Protein accessionYP_002546993 
Protein GI222106202 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCTC CGCACCAAGC CCAGACACCA TTGCTTTTGA AAAACCTCCG CCCTCTGGCC 
TTTGGCGAGG CAACACCGCC GCAGGCCATC GATATTTTGA TTGGCGCGGA TGGCAAGATC
ATCAAGGTCG GACAAAACAT CGCCCTGACA GAAGAGGCCG AAGTCATTGA TGGCAAAGGG
GCCTGGATTT CGCCCGGCTG GGTCGATCTG CATGTGCATA TCTGGCATGG CGGCACGGAT
ATTTCCATTC GCCCATCGGA ATGCGGGGCC GAACGCGGCG TGACCACGCT GGTCGATGCA
GGCTCCGCCG GAGAGGCCAA TTTCCACGGT TTTCGCGAGT ATATCATTGA GCCATCGCGG
GAGCGGATCA AGGCATTTTT GAACCTTGGC TCCATTGGTC TGGTGGCCTG CAACCGCGTG
CCGGAACTGC GCGACATCAA GGATATCGAT CTCGACCGCA TTCTGGAATG CTACGCGGAC
AACAGCGAGC ATATCGTGGG CCTGAAAGTG CGGGCCAGCC ACGTCATCAC CGGATCATGG
GGCGTAACCC CGGTCAAACT CGGCAAGAAA ATCGCCAAAA TCCTGAAAAT TCCGATGATG
GTGCATGTTG GCGAACCGCC AGCGCTCTAT GACGAAGTGC TGGAAATCCT CGGCCCCGGC
GATGTCGTGA CCCACTGCTT CAACGGCAAG GCCGGGTCCA GCATCATGGA AGACGAAGAC
CTTTATAACC TCGCCGAGCG CTGCGCTGGC GAAGGAATCC GGCTGGATAT CGGCCATGGC
GGTGCCTCCT TCTCGTTCAA AGTGGCAGAA GCGGCCATCA AACGGGGGCT TTTGCCCTTT
TCGATCTCCA CCGATCTGCA TGGCCATTCG ATGAATTTTC CGGTCTGGGA TCTGGCCACC
ACCATGTCCA AACTGCTCTC CGTGGGCATG CCGTTTGAGA AGGTGGTGGA GGCCGTGACC
CATGCACCAG CCTCCGTTAT CCGCCTGTCG ATACAGGATC GGCTGGTGCC GGGAGCGCGG
GCCGATTTCA CCATTTTCGA TCTGGTCGAT TCCGATCTCG AAGCCACGGA TTCCAACGGC
GATGTTGCCC GCCTGACCCA GCTGTTTGAG CCGCGCCATG CCGTGATTGG CCGCGAGGCC
ATTGCCGCCA GCCGCTACAT CCCTCGCGCC CGCAAGCTGG TGCGCCACAG CCATGGGTAC
TCGTGGCGCT GA
 
Protein sequence
MLSPHQAQTP LLLKNLRPLA FGEATPPQAI DILIGADGKI IKVGQNIALT EEAEVIDGKG 
AWISPGWVDL HVHIWHGGTD ISIRPSECGA ERGVTTLVDA GSAGEANFHG FREYIIEPSR
ERIKAFLNLG SIGLVACNRV PELRDIKDID LDRILECYAD NSEHIVGLKV RASHVITGSW
GVTPVKLGKK IAKILKIPMM VHVGEPPALY DEVLEILGPG DVVTHCFNGK AGSSIMEDED
LYNLAERCAG EGIRLDIGHG GASFSFKVAE AAIKRGLLPF SISTDLHGHS MNFPVWDLAT
TMSKLLSVGM PFEKVVEAVT HAPASVIRLS IQDRLVPGAR ADFTIFDLVD SDLEATDSNG
DVARLTQLFE PRHAVIGREA IAASRYIPRA RKLVRHSHGY SWR