Gene VC0395_A0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0049 
Symbolgcp 
ID5136340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp45948 
End bp46967 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content50% 
IMG OID640531509 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001216022 
Protein GI147675246 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.62165e-14 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCATTA TTGGTATTGA AACCTCTTGT GACGAAACGG GTATCGCGAT TTACGATGAC 
GAAAAAGGAC TGCTGTCTCA TAAGCTTTAC AGTCAGGTAA AACTGCATGC CGATTATGGT
GGTGTGGTGC CTGAGCTGGC TTCGCGTGAT CATGTAAAAA AAACCATCCC ACTCATCAAA
GCGGCGATGG CAGAGGCAAA CGTGACGCCG CAAGATTTAG ACGGTGTGGC TTTTACCGCA
GGCCCCGGTT TGGTTGGGGC GCTCTTGGTT GGCGCTACGA TTGGGCGCAG TTTAGCGTAC
GCTTGGGATG TGCCAGCGGT GCCGGTTCAT CACATGGAAG GGCATCTTCT TGCTCCGATG
CTGGAAGAGA ATCCGCCGCC GTTTCCGTTT GTCGCTTTGC TGGTATCGGG TGGTCACACC
ATGCTGGTGG AAGTGAAAAA CATTGGTGAA TACCGCATTT TAGGTGAGTC TATCGATGAT
GCGGCTGGCG AAGCCTTTGA TAAAACGGCC AAATTGATGG GATTGGATTA TCCAGGTGGC
CCGTTATTGG CCAAGCTGGC GGAAAAAGGG ACTCCGGGAC GCTTTAAATT TCCCCGTCCT
ATGACGGACA GACCGGGGCT CGATATGAGC TTTTCCGGTT TAAAAACTTT TACTGCCAAT
ACCATTGCTG CAAATGGCGA CGATGAACAG ACCCGTGCGG ATATTGCTTA CGCCTTCCAA
GAGGCCGTGT GTGACACTTT AGTCATTAAA TGTAAACGCG CATTGGAGGA GACAGGACTT
AAGCGTGTGG TGATTGCGGG TGGTGTGAGT GCCAACAAGC AGTTGCGTGC TGATTTGGAA
AAACTCGCGA AAAAAATCGG TGGCGAAGTG TATTACCCAC GTACTGAATT TTGTACCGAT
AACGGAGCGA TGATCGCTTA TGCGGGCATG CAACGTTTGA AAAATGGTGA TGTGTGTGAA
CTTGGCTTGC AAGCTCGCCC GCGTTGGCCG ATTGATCAGT TAACGTCAAT TCAGAAATAA
 
Protein sequence
MRIIGIETSC DETGIAIYDD EKGLLSHKLY SQVKLHADYG GVVPELASRD HVKKTIPLIK 
AAMAEANVTP QDLDGVAFTA GPGLVGALLV GATIGRSLAY AWDVPAVPVH HMEGHLLAPM
LEENPPPFPF VALLVSGGHT MLVEVKNIGE YRILGESIDD AAGEAFDKTA KLMGLDYPGG
PLLAKLAEKG TPGRFKFPRP MTDRPGLDMS FSGLKTFTAN TIAANGDDEQ TRADIAYAFQ
EAVCDTLVIK CKRALEETGL KRVVIAGGVS ANKQLRADLE KLAKKIGGEV YYPRTEFCTD
NGAMIAYAGM QRLKNGDVCE LGLQARPRWP IDQLTSIQK