Gene Acid345_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0494 
Symbol 
ID4068619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp608625 
End bp609770 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content61% 
IMG OID637982498 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_589573 
Protein GI94967525 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family
[TIGR03723] putative glycoprotease GCP 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG CCGTCATCCT GGGAATTGAA AGCTCGTGCG ACGAGACCGC CGCGGCTGTG 
ATCCGAAACG GCGCAGAAAT CCTCTCCAGC GTAGTGTTCT CGCAGATCTA CACGCATATG
CGGTACGGCG GCGTGGTGCC GGAACTGGCC TCGCGCGAGC ACTTGAAGGC TATCGTTCCC
GTGGTGCGCC AGGCGGTGGA AGACGCTGGA CAGAGCTATG ACAAGATTGA TGCCATCGCT
GTGACACGCG GACCCGGACT GGCCGGAGCG CTGCTGGTGG GCGTGAGTTA TGCGAAGGCG
CTGTCATTCG CGCTGGATAA GCCGCTGATC GGCGTGAACC ACCTGGAAGG ACACATTCAC
GTGGTGCTGC TGGAACAGAA GCAGCAAGGC GTCGGCGAAA TTCAGTTTCC GGTGCTGGCG
CTGGTGGTGA GCGGCGGACA CACGCATCTT TACCTTGCAG AGAAGAAGGA TGCGGGATGG
ACGTATCGCG ATGTGGGACA CACGCGCGAC GATGCGGCCG GCGAGGCCTA CGACAAAGTC
GCGAAGCTGC TGGGGCTTGG ATATCCCGGG GGGCCGATTC TCGATGGCCT GGCAAAGCAT
GGCGATCCCA GGGCGGTGAG GTTTCCGTTC GCGCAGATCA AGCATCGCGA CCGCAATCCG
CAGAACCGAC ATGAGGATGA CGATGCGCGA GTGGATTTCT CGTATAGCGG TATCAAGACC
GCGGTGCTGC GCTATGTTGA AACGCACGAG ATGAAGGCGG CGATTGAAGC GCGGCGAACG
GCGTTGAAGG AAATCGAGAA GCCATCGCAG GACGATTATT TGCGGGTGTG CGATCGGCAG
ACGCTCGATC TGATTGCATC GTTTCAGCGC GCGGTGGTGA ATGATCTTGT CTCGAAGGCG
CTGCACGCGG CTGCGGAAAA CAATGCAGCA ACGCTCTTGG TGACGGGCGG AGTTGCGGCG
AATTCCGAGC TGCGTGAGAC GTTTGAACGA CGTGCCGGCG AACTTGGGTT GCCTGTGTAT
TTCCCTTCGC GACCGCTGTC TACGGACAAC GCGGCGATGA TTGCGGCGGC GGCGTATCCG
CGGTTTCTGA GCGGAGAATT TGCGGCGCCT GATCTGTCCG CGGAAGCCAA TCTTCGCCTG
CGCTAA
 
Protein sequence
MADAVILGIE SSCDETAAAV IRNGAEILSS VVFSQIYTHM RYGGVVPELA SREHLKAIVP 
VVRQAVEDAG QSYDKIDAIA VTRGPGLAGA LLVGVSYAKA LSFALDKPLI GVNHLEGHIH
VVLLEQKQQG VGEIQFPVLA LVVSGGHTHL YLAEKKDAGW TYRDVGHTRD DAAGEAYDKV
AKLLGLGYPG GPILDGLAKH GDPRAVRFPF AQIKHRDRNP QNRHEDDDAR VDFSYSGIKT
AVLRYVETHE MKAAIEARRT ALKEIEKPSQ DDYLRVCDRQ TLDLIASFQR AVVNDLVSKA
LHAAAENNAA TLLVTGGVAA NSELRETFER RAGELGLPVY FPSRPLSTDN AAMIAAAAYP
RFLSGEFAAP DLSAEANLRL R