Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_4144 |
Symbol | gcp |
ID | 7386920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | - |
Start bp | 3492042 |
End bp | 3493139 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643652838 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002551011 |
Protein GI | 222150054 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGTC CTTTGAAGAT TCTCGGCATC GAAACCAGCT GCGATGAAAC CGCAGCCGCT ATCGTTCTGC GCCATGATGA TGGGCGCGGC GAAATCGTGT CCGATGTGGT GTTGAGCCAG CTGGACGAAC ACAGCGTCTA TGGCGGCGTC GTGCCGGAAA TCGCCGCCCG CGCCCATGTC GAGGCGCTGG ATACGCTGGT GGAAGAGGCC CTGGCCAAGG CCGATACCCG GCTTTGCGAT ATCGATGCCG TGGCGGCCAC CGCCGGTCCC GGCCTGATCG GCGGGTTGAT CGTCGGGTTG ATGACCGGCA AGGCGATTGC CCGGGCGGCG GGAAAGCCGC TTTTTGCTAT CAACCATCTG GAAGGCCATG CGCTGACGGC GCGGCTGACG GACAATGTTG CCTTTCCCTA TCTGATGCTG CTGGTTTCAG GTGGTCATAC CCAATTGGTT CTGGTGCGCG GCGTCGGCGA TTACCAGCGC TGGGGCACCA CCATCGACGA TGCGCTCGGA GAGGCGTTTG ACAAGACCGC CAAGCTGCTC GGCCTTCCCT ATCCGGGTGG CCCGGCGGTG GAGCGCGCCG CCCTTCATGG CAACGAAAAA CGCTTCAATT TTCCGCGACC GCTGGTGGGC GAGGCGCGGC TGGATTTTTC CTTTTCTGGA TTGAAGACCG CGGTGCGGCA GGCGGCACAG GCGGCAGCGC CGGTTAGCCA AGCGGATATC GCCGATATCT GCGCCTCTTT CCAGCGCGCC ATCGCCCGCA CCATGGACGA CCGGATCGGC CGGGGGCTGG AGCGGTTCAA TACGGAATAT CCGGGCCTTG AGGCCAAGCC AGCCCTGGTG GTGGCTGGCG GCGTCGCCGC CAATCAGGCG CTGCGTGCTG CCTTGCAGAC GCTTTGCGAC CGGCATGGGT TTCGCTTTAT CGCGCCACCG CATCATCTTT GCACCGACAA TGCGGCAATG ATCGCCTGGG CCGGGCTAGA GCGGCTGGCC CATGGCTTTC CGGCTGATGA CCTGTCGGTC TCGCCACGCG CCCGCTGGCC GCTGGACGCC AATGCGGCGA CCCTGCTTGG CTCCGGCAAG CGAGGCGCAA AAGCATGA
|
Protein sequence | MTGPLKILGI ETSCDETAAA IVLRHDDGRG EIVSDVVLSQ LDEHSVYGGV VPEIAARAHV EALDTLVEEA LAKADTRLCD IDAVAATAGP GLIGGLIVGL MTGKAIARAA GKPLFAINHL EGHALTARLT DNVAFPYLML LVSGGHTQLV LVRGVGDYQR WGTTIDDALG EAFDKTAKLL GLPYPGGPAV ERAALHGNEK RFNFPRPLVG EARLDFSFSG LKTAVRQAAQ AAAPVSQADI ADICASFQRA IARTMDDRIG RGLERFNTEY PGLEAKPALV VAGGVAANQA LRAALQTLCD RHGFRFIAPP HHLCTDNAAM IAWAGLERLA HGFPADDLSV SPRARWPLDA NAATLLGSGK RGAKA
|
| |