Gene Acid345_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3766 
Symbol 
ID4071050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4449616 
End bp4451013 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content57% 
IMG OID637985789 
Productpeptidase U62, modulator of DNA gyrase 
Protein accessionYP_592840 
Protein GI94970792 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACCA GAGAACAAGC TCACGACATT TTTGGAGTCC TGCGGAAGAG TACTTCCTGC 
GATGAAGTAG AACTGCTGGT GAGCGGCGGA AATTTTGCGC TCACGCGCTT CGCCAACAAT
GGTGTTACGC AGAATGTCTC GGAAGAGAAC TACTCGTTCT CCGTGCGCGT GAATCTTGAT
GGCCGCACTG CGCGAGCGAC CACAAACAAA CGCGATGCGG AGAGCCTCAA ACGCGTTGTG
GAACAGGCAG AGGCCCTCGC TCGAGTGCAG CAGCAAGATC CCGACCTGCT GCCGATGCCG
ACGCCGAGGG AAGTCGCTTC CCATTCGGAA GGGCTGCCGG ATGCTCCACC ATCGCGCTAC
TTCGAAGCGA CCGCCGCAAT GACCGCCGGG GACCGCGCGC AGCAAGTTGA GCGGATGGTG
ACTGCCGGCC GAAAACATGG GCTGACCGCT GCCGGAACTT ATTCTTCGGG CGCGCACATG
GAAGGCGTCT TCAACTCACG CGGTGTGACG CAGTGGCACG AACAGAGCAG CGCCGAAGCG
TCCATCACCA TGCTCGGAGA AAATTCCTCA GGATGGCAAA AAGCGAACTC GACGAACGCC
GACGACTTCG ATGCCGCGAA GCTCGCTGAG ATCGCTGCCG AGAAGGCCAG CCTTTCCGCT
GATCCCAAGG AACTCCCCGC CGGGAAATAC ACGGTCGTCC TGGAGCCATC TGCTGTTCTC
GACATTGTCG GCTTCATGTT TTGGGATTTC AGCGGACTCG CACTGCTCGA CCAACGTTCG
TTCCTCAACG ATCGAGTGGG AAAGCAAATA TTCGGTGACG AACTGAATAT CAGCGACGAT
GTTTATCATC CGCTGCAATC TGGATTCGCT TTCGATGGCG AAGGCATGCG CCGGAGTCGC
GTCAATTTGG TGGAAGGTGG AGTGCCGAAA CGCGTTACCT TCGCCCGTGC CACAGCGGCG
CTGATGGCGA AATCGGAGCT TGGCCGAAAG GTCGGGTCGG TGCAGGCGAC TGGACACGGA
TTCGCATTGC CCAACGAAAT GGGCGAGGCC CCGATGAACA TCGTGTTTGC ACCGCCGAAA
GACCCTAAAA CTGTTGACCA AATGGTTAAC AATGTCGAGC ACGGGATCCT GGTGACGAGG
CTCTGGTATA TCCGGGAAGT AGAGCCCTAC GAGAAGATTC TGACCGGTAT GACGCGTGAC
GGTACCTTCC TGATTGAGAA CGGAAAGGTA AAGCACGGGG TGCGAAATTT CCGTTTCAAC
CAGGGGTTGA TCGCGATGCT GTCGAATATC GAGGCGATGA GCATGCCTGT CCGGGCCAGC
GGCGAGGAGA GTATTGATAT GGTGGTTCCG GCGATGACGG TACGGGAGTT CAACTTCACT
GAAGTAACCA AGTTCTAG
 
Protein sequence
MLTREQAHDI FGVLRKSTSC DEVELLVSGG NFALTRFANN GVTQNVSEEN YSFSVRVNLD 
GRTARATTNK RDAESLKRVV EQAEALARVQ QQDPDLLPMP TPREVASHSE GLPDAPPSRY
FEATAAMTAG DRAQQVERMV TAGRKHGLTA AGTYSSGAHM EGVFNSRGVT QWHEQSSAEA
SITMLGENSS GWQKANSTNA DDFDAAKLAE IAAEKASLSA DPKELPAGKY TVVLEPSAVL
DIVGFMFWDF SGLALLDQRS FLNDRVGKQI FGDELNISDD VYHPLQSGFA FDGEGMRRSR
VNLVEGGVPK RVTFARATAA LMAKSELGRK VGSVQATGHG FALPNEMGEA PMNIVFAPPK
DPKTVDQMVN NVEHGILVTR LWYIREVEPY EKILTGMTRD GTFLIENGKV KHGVRNFRFN
QGLIAMLSNI EAMSMPVRAS GEESIDMVVP AMTVREFNFT EVTKF