Gene Acid345_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2241 
Symbol 
ID4072986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2659967 
End bp2661214 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID637984257 
ProductAAA ATPase 
Protein accessionYP_591316 
Protein GI94969268 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.245938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.85938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC TATTGAAGAA AATTCTGGAT GGCCAGAACC AGTTTGCGTC TGGCGGCTTG 
TTGCTGATGA TCCTCGGCAG CGTGGGCGTG TTTTTGCGTT CGTTGCCGTC GCAACTGTGG
AGCTGGATCA TGCGGCAGTC CACCATGTCC ATCACCGTAA AGGACGACGA CCAGGCCTTC
GCGTGGGTGA AGGAGTGGTT TCTCGAGCAA AAGTTCCTGA AGCGCGTGCG TCGTCTCGAT
CTCGATACGT CGCTGCGTGG CGCGGAAGCT GCCATGGTTC CGGCGCCGGG CCGTCACTGG
TTCATGCGCG GCGGGCGCCC GTACTGGGTG TGGTTCTGGC GAACCGAGAA CACCAAGGGC
TACAACCAGC GCCGCATGGA GTCGTTCATG ATCGAGACCA TCGGACGCGA CCAGCAGGTG
CTGCGGCAGT TTGTCGCCGA AGTGGTTGCG TGCCACAAGA AGAAGCTGCG CACCGCGTCG
TACCTGTACC TGTACGACGA CGGCTGGGAC CGCGTTGAGT CCTACTGGCC GCGACGGCTC
GACTCGGTGC TGTTGAAGCC GGGCGAGAAG GAACACCTCA TTCAAGACCT GGAGCGCTTC
CGCGCGTCGC GGGACCGCTA CCGCCGGTTG GGTGTTCCCT ACCATCGCGG CTACCTGTTC
TACGGACCTC CGGGAACCGG CAAGACGTCG TTGGTATCGG CGTTGGCCGC GCGGTTCGGG
ATGTCGGTGT ACATCGTGAA CCTCTCGGAA CTGAACGACC GTACGCTGAA GACCGCGATG
AACTGGGTTT CGGATAACTC GGTCATCCTC TTCGAGGACA TCGACTGCAT GAACGCCAGC
ACCCGGCGTT CACAAGCAGG CGGCGCACCG CGAAGTGAGA CCGCAGACGA TCCGAAGGAG
AAGAGCGCGA TCGACAAGAT GGGCGTGAGC TTATCGGGTT TGTTGAACGT GCTCGATGGC
TTCTCGGCGC CGGAAAACGT GGTGTACGCG ATGACCACCA ACGACATCAG CGGACTCGAC
GCGGCGTTGC TGCGTCCGGG CCGCATTGAT TACAAGCTCT ACCTCGGCGA GGCCTGCGAG
TCGCAGAAGG TGGAGTTGTA CCGCCGCTTC TTCCCTGAGT CGTCGGAAGA GGAAGCTCGC
GCCTTCGCAC AAGCGAACTG GGCCGAGACC ATGGCGGAGT TCCAGGGACT GCTTCTGGCA
TTGGAGCAGG AAGTGGGAAC GACGGAAGTC GGAGTGGTTC AGTCGTGA
 
Protein sequence
MFDLLKKILD GQNQFASGGL LLMILGSVGV FLRSLPSQLW SWIMRQSTMS ITVKDDDQAF 
AWVKEWFLEQ KFLKRVRRLD LDTSLRGAEA AMVPAPGRHW FMRGGRPYWV WFWRTENTKG
YNQRRMESFM IETIGRDQQV LRQFVAEVVA CHKKKLRTAS YLYLYDDGWD RVESYWPRRL
DSVLLKPGEK EHLIQDLERF RASRDRYRRL GVPYHRGYLF YGPPGTGKTS LVSALAARFG
MSVYIVNLSE LNDRTLKTAM NWVSDNSVIL FEDIDCMNAS TRRSQAGGAP RSETADDPKE
KSAIDKMGVS LSGLLNVLDG FSAPENVVYA MTTNDISGLD AALLRPGRID YKLYLGEACE
SQKVELYRRF FPESSEEEAR AFAQANWAET MAEFQGLLLA LEQEVGTTEV GVVQS