Gene Avi_5847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_5847 
Symbol 
ID7380627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011988 
Strand
Start bp866477 
End bp868102 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content64% 
IMG OID643649377 
Producthypothetical protein 
Protein accessionYP_002547614 
Protein GI222106823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1 
TIGRFAM ID[TIGR00370] conserved hypothetical protein TIGR00370
[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0879773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTTC TCCCCGTCAG CCTGACGACG ATGCTGGTCG AGCTTGCCAA TCTCGATGAG 
ACCTTGGCTC TGTTCGCCTC GCTTCAGGCA AGCCCCATCC CCGGCATCGA TGAAATGGTA
CCCGCCGCTC GGACCTTGAT GATCCGGTTT CGCCCGCAGA CCATCAGCGC GCAAGCCCTG
GCGGCTGAGG TTAGCACCCG CGACCTGTCC GCCAAACTGG CCCCATCCGA TCATCTGGTC
GAAATACCCG TAGACTATGA CGGCGAAGAT CTGGCCGACG TGGCGGAACT GACCGGGCTT
GCCGTGGAAG AGGTTATCCG CCGCCATACG GAAAGCACGT TTACGGTTGC TTTTTGTGGC
TTTGCGCCGG GCTTCGGCTA TCTCGTCGGC GGCGACCCGG CCTTGCATGT GCCGCGCCGC
AAAAGCCCGC GCACCCGCAT TCCCGCTGGC GCCGTGGCAC TGGCAGGTGC CTTCAGCGGT
GTCTATCCGC AGGCCAGCCC CGGCGGTTGG CAAATCATCG GCGTGACACC GGAGAAAATG
TGGGATCTCA GCCGCGATCC GCCAGCACTG TTTCAGCCCG GCTATCAGGT GCGCTTCACC
GATATGGCAA AAGCGGTCCA TCCGGTTGTT ATTCCTGCAA GCGATGAAGC CCCTAGCAAT
CCGGCTGATG GGCTTTCCGA GACGAATGGC GCGGGGCATT TGACCGTGCT TGCCGCCCCC
ATGCCTGCGG TGTTCCAGGA CCTTGGCCGT CTCGGCCAGA CCGGCCAGGG CGTTTCCGCC
TCCGGTGCTC TGGACCAGGG AGCCTTGAAG GCAGCCAATC GCGTGGTGGG CAATCCATCG
GGCCTCCCCT GTCTGGAAAT CACCCTTGGT GGCTTTTCTT TTGAAAGCGA TAGCCGCGCC
GTCATCGCGC TGACCGGTGC GCCCTGCCCG GTCGCTATTC GGGATGCCTC GGGTCGGGTG
ATGCAAGCTG AAACCTATCA GCCGATTGCG CTGGAACCCG GCGATATCGT CAGCCTTGGC
CAGCCTCCAC AGGGCATGCG CAGCTATCTT GCCGTGCGCG GTGGTTTTGC GGTCCAGCCG
GTGCTGGGCA GCTATGCCAC CGATACGCTG GCCGTGGTCG GACCTGAGCC GGTGACGGCA
GGCACGGTTC TGACGCTGAA GGGCAACAGC GAGGGTCTCG CCGCCGTTTC CCTGCATGAA
TCTCCACCGC AAGATCTGCC GAGCGCAGGC GATATCGTCA CGCTGGACGT CGTGCTTGGG
CCGCGCACCG ATTGGTTCAC CGAAAAAGGT CTGGCGACGC TGACGGACCA GCTTTGGCAG
GTGACGCCGC AATCCAGCCG CGTCGGTATC CGGCTTGCGG GCGACGTGCC GCTGGAGCGG
ATCGACAGCG CCGAACTGCC AAGCGAAGGC ACCGCCACCG GTGCCATTCA GGTGCCGCAT
AGCGGCCAGC CGGTGCTGTT TCTGGCAGAC CATCCGCTGA CCGGTGGCTA TCCCGTCATC
GGCACGGTTG CCGAATATCA TCTGGATCTT GCGGGACAGA TCCCGATCAA CGCCCGCATC
CGCTTCCGGC CCATTGCCCC TTTCGCAGAC ATCACCCCTG TCGATGCCCC AGCCGCGAAC
GCATAG
 
Protein sequence
MRFLPVSLTT MLVELANLDE TLALFASLQA SPIPGIDEMV PAARTLMIRF RPQTISAQAL 
AAEVSTRDLS AKLAPSDHLV EIPVDYDGED LADVAELTGL AVEEVIRRHT ESTFTVAFCG
FAPGFGYLVG GDPALHVPRR KSPRTRIPAG AVALAGAFSG VYPQASPGGW QIIGVTPEKM
WDLSRDPPAL FQPGYQVRFT DMAKAVHPVV IPASDEAPSN PADGLSETNG AGHLTVLAAP
MPAVFQDLGR LGQTGQGVSA SGALDQGALK AANRVVGNPS GLPCLEITLG GFSFESDSRA
VIALTGAPCP VAIRDASGRV MQAETYQPIA LEPGDIVSLG QPPQGMRSYL AVRGGFAVQP
VLGSYATDTL AVVGPEPVTA GTVLTLKGNS EGLAAVSLHE SPPQDLPSAG DIVTLDVVLG
PRTDWFTEKG LATLTDQLWQ VTPQSSRVGI RLAGDVPLER IDSAELPSEG TATGAIQVPH
SGQPVLFLAD HPLTGGYPVI GTVAEYHLDL AGQIPINARI RFRPIAPFAD ITPVDAPAAN
A