Gene Avi_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_4144 
Symbolgcp 
ID7386920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011989 
Strand
Start bp3492042 
End bp3493139 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID643652838 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002551011 
Protein GI222150054 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGTC CTTTGAAGAT TCTCGGCATC GAAACCAGCT GCGATGAAAC CGCAGCCGCT 
ATCGTTCTGC GCCATGATGA TGGGCGCGGC GAAATCGTGT CCGATGTGGT GTTGAGCCAG
CTGGACGAAC ACAGCGTCTA TGGCGGCGTC GTGCCGGAAA TCGCCGCCCG CGCCCATGTC
GAGGCGCTGG ATACGCTGGT GGAAGAGGCC CTGGCCAAGG CCGATACCCG GCTTTGCGAT
ATCGATGCCG TGGCGGCCAC CGCCGGTCCC GGCCTGATCG GCGGGTTGAT CGTCGGGTTG
ATGACCGGCA AGGCGATTGC CCGGGCGGCG GGAAAGCCGC TTTTTGCTAT CAACCATCTG
GAAGGCCATG CGCTGACGGC GCGGCTGACG GACAATGTTG CCTTTCCCTA TCTGATGCTG
CTGGTTTCAG GTGGTCATAC CCAATTGGTT CTGGTGCGCG GCGTCGGCGA TTACCAGCGC
TGGGGCACCA CCATCGACGA TGCGCTCGGA GAGGCGTTTG ACAAGACCGC CAAGCTGCTC
GGCCTTCCCT ATCCGGGTGG CCCGGCGGTG GAGCGCGCCG CCCTTCATGG CAACGAAAAA
CGCTTCAATT TTCCGCGACC GCTGGTGGGC GAGGCGCGGC TGGATTTTTC CTTTTCTGGA
TTGAAGACCG CGGTGCGGCA GGCGGCACAG GCGGCAGCGC CGGTTAGCCA AGCGGATATC
GCCGATATCT GCGCCTCTTT CCAGCGCGCC ATCGCCCGCA CCATGGACGA CCGGATCGGC
CGGGGGCTGG AGCGGTTCAA TACGGAATAT CCGGGCCTTG AGGCCAAGCC AGCCCTGGTG
GTGGCTGGCG GCGTCGCCGC CAATCAGGCG CTGCGTGCTG CCTTGCAGAC GCTTTGCGAC
CGGCATGGGT TTCGCTTTAT CGCGCCACCG CATCATCTTT GCACCGACAA TGCGGCAATG
ATCGCCTGGG CCGGGCTAGA GCGGCTGGCC CATGGCTTTC CGGCTGATGA CCTGTCGGTC
TCGCCACGCG CCCGCTGGCC GCTGGACGCC AATGCGGCGA CCCTGCTTGG CTCCGGCAAG
CGAGGCGCAA AAGCATGA
 
Protein sequence
MTGPLKILGI ETSCDETAAA IVLRHDDGRG EIVSDVVLSQ LDEHSVYGGV VPEIAARAHV 
EALDTLVEEA LAKADTRLCD IDAVAATAGP GLIGGLIVGL MTGKAIARAA GKPLFAINHL
EGHALTARLT DNVAFPYLML LVSGGHTQLV LVRGVGDYQR WGTTIDDALG EAFDKTAKLL
GLPYPGGPAV ERAALHGNEK RFNFPRPLVG EARLDFSFSG LKTAVRQAAQ AAAPVSQADI
ADICASFQRA IARTMDDRIG RGLERFNTEY PGLEAKPALV VAGGVAANQA LRAALQTLCD
RHGFRFIAPP HHLCTDNAAM IAWAGLERLA HGFPADDLSV SPRARWPLDA NAATLLGSGK
RGAKA