Gene Acid345_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1829 
Symbol 
ID4072890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2208871 
End bp2209839 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content59% 
IMG OID637983838 
ProductTPR repeat-containing protein 
Protein accessionYP_590904 
Protein GI94968856 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.725463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGTC GTTTCTTTAC CTTTTGTCTT CTACTTACGT TCAGCGGACT TGCGCCTGCT 
GCGTACGCCG CCGATGCTGA GGTTCAGCTC ATCCGCGCTG GTCGCATCGA TGACGCTATC
ACCCAGTTGC AGCAGAAGGT AAAAGCGAAC CCATCCGATG CAGAGGCGCT CAATCTTCTC
GCGCGGGCGT ATTACGCGAT TGAAGACTGG GACGACGCCA TTGCCGCCAA TCAGAAGGCG
TTGAAGCTTC AGCCCAACAG CAGCGAATTT CACGGCTGGA TGGGACGCGC TTACGGCGAA
AAAGCTGACA CGGTTGGAAT GATTGGCGCC GCGACGCTTG CACGAAAGAC GAAGAATGAG
TTCGAGCGCG CGGTGCAGTT GAATCCCGTC GCGAGCCAGG CACGGCTCGA TCTTTCGGAG
TATTACATCC AGGCTCCCGG CTTCATGGGT GGCGGCATGG ACAAGGCCGA ACAGCAGGCA
GACGCAACCG CCCGCTTCGA CGAAGCCGCC TCGCACCTCA TTCGCGCCCG CATGGCGCAA
CAGAAAAAAG ACCTGGATGT CGCGGAGCAG GAGTTCCGCG CGGCGATTGC GCACGCCAAG
AACCCAGCGC CTTATTACCT GAACCTTGCG TCGTTCTATC AAACACAGAA GCGTTATCCG
GAGATGGAGC AGGCGATCGT GACGGCGGTT AGTGGAGCGG ATCGCCCGTC TGCGGTGTTG
TACGACGCGG CCTCGATCCT TAACCGCTCT GGACGTAACC TGCCGGGCGC CGTGCAATAC
CTCAAGCAAT ACCTCGCTGA AGGCCGCTTC GACGAAGATG CTCCGCCCTT CCGCGCCCAC
TACCTGATGG GTCAGGTTCT GGAGAAGATG AAGAAATCCT CGGAAGCGGT AGCCGAGTAT
CGTTCTGCAC TGAGCATGGC GAGTGAGTTC AAAAAGGCGC AGGCCGCGCT GGATAAGCTG
CAAGGGTGA
 
Protein sequence
MMRRFFTFCL LLTFSGLAPA AYAADAEVQL IRAGRIDDAI TQLQQKVKAN PSDAEALNLL 
ARAYYAIEDW DDAIAANQKA LKLQPNSSEF HGWMGRAYGE KADTVGMIGA ATLARKTKNE
FERAVQLNPV ASQARLDLSE YYIQAPGFMG GGMDKAEQQA DATARFDEAA SHLIRARMAQ
QKKDLDVAEQ EFRAAIAHAK NPAPYYLNLA SFYQTQKRYP EMEQAIVTAV SGADRPSAVL
YDAASILNRS GRNLPGAVQY LKQYLAEGRF DEDAPPFRAH YLMGQVLEKM KKSSEAVAEY
RSALSMASEF KKAQAALDKL QG