Gene Acid345_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2242 
Symbol 
ID4072987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2661324 
End bp2662949 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content60% 
IMG OID637984258 
ProductTPR repeat-containing protein 
Protein accessionYP_591317 
Protein GI94969269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.433133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA TTGCTACGCT TTCGATTTGT GCTGCGCTGG CGCTGGGCTC GGCTGCTGTC 
GCGCAGGAGG ATGTCCATAA GCACCATCAT GACGATGGCG TGGATCACAC CACGAACTTC
GGGCATGTGA ACTTTCAGAC ATCGTGCTCG CCCGCGGCGC AAACGCAATT CGAGACCGGA
GTAGCGGCGT TGCATTCGTT TGAGTACACC TCGGCAAAGA AGTTGTTTGG AGCGGCGGAA
CAGGCGGATT CAGAGTGCGC CATCGCCTAT TGGGGCGAGG CCATGACGCT CTGGCACCAG
CTTTGGGACA CTCCCAAACA GGATGTGCTC AACGAAGGTT GGGCCATGAT CCAGAAGGGC
GAAAAAGCAA AGCATACCAG CGCACGCGAG AGCGGCTATC TGAAAGCCGT TGAGGCCTAC
TACAAGCCGT CAAAACAGAC CCCGGACGAG CGCGCGACAG CGTACTCCGA CTCGATGGGC
AAACTGCACG ACAAGTATCC GGACGACGAG GAGGCTGCCG TCTTCTACGC CCTCTCGCTG
CTCGCTTCTG AGCCGCCGAC CGACACAACT CTCGCGAATC CGAAGAAAGC TGTCGCAATC
CTGAACCAGG TTCTGGCGAA GGACCCTGAC CATCCGGGCG TGACGCATTA CATCATTCAC
GCCAGCGACA ATCCGCACAT GGCGCAGGAC GCCGTCGCCG CAGCGAAGAA GTATGCGAGC
ATCGCGCCGG GCTCGCCGCA CGCGGTGCAT ATGCCATCGC ACATCTTTGC CCGCGTCGGC
TACTGGCAGG ACTCCATTAA CTCCAACCTC GCCGCCATCG CGATCGCGAA GAAAGGCAAC
GAGGTCGACT ACCAACTCCA CCCCATGGAC TTCCTGATGT ACGCCTACCT GCAAACCGGG
CAGGACGACA AGGCCCGCAC AACCGAGCAG GAAGCCGTCG GCATGGAGAA CAAGGGCTAT
GGCCGCGGCC GCGAGCCGTT CTATTACTAC GTGCAGGCGC ACTTCCCTTC CATGCTTGCG
CTCGAGCTGC GCGACTGGAA GGCCGCTGAG GCATTACAGC CCGTCGAAGG CGGGGAGCCC
GGATTCAAAG CCATCACCTA TTGGGCGCAG GCCGTCGGCG CAGGGCATTT GAAAGATGTT
GCGAAGGCTC AGGAAGCCGT AAAGAACGTG GATGCCGCCA TCGAGGCAGA AAACAAAGCG
CATCCCGAGT ATTCCCACGC CCCCGTGAAC ACTGACAAAA ACGAAGCCCA CGCCTGGCTC
GCCTACGCGC AAGGCAACAA CGACGAAGCA TTCCGTCTGC TGAAGGAAGT GATCGACTAC
CAGGACAAAG TCGGCAAGGG CGAAGTCGAA CTGCCTGCCC GCGAAATGTA TGCCGACATG
CTGCTCGAAC TCAATCGTCC GGCAGATGCG CTGGAACAAT ACAAAATTTC CCTGAAGACC
GATCCGAACC GCTTCAACGG CGTCTATGGC GCCGGCAAAG CGGCGGAGAT GGCCGGACAG
CATGAAGTCG CTGTCGGCTA CTACAAGCAG TTGGCCGAAA ACTGCAAAGA GGCCGCGCCA
GTACGTTCCG AGTTGGCGCA CGCAAGAGAA GTAGCCGGCG GAGCGACGGT GGCCGCGGGA
CAATAG
 
Protein sequence
MKRIATLSIC AALALGSAAV AQEDVHKHHH DDGVDHTTNF GHVNFQTSCS PAAQTQFETG 
VAALHSFEYT SAKKLFGAAE QADSECAIAY WGEAMTLWHQ LWDTPKQDVL NEGWAMIQKG
EKAKHTSARE SGYLKAVEAY YKPSKQTPDE RATAYSDSMG KLHDKYPDDE EAAVFYALSL
LASEPPTDTT LANPKKAVAI LNQVLAKDPD HPGVTHYIIH ASDNPHMAQD AVAAAKKYAS
IAPGSPHAVH MPSHIFARVG YWQDSINSNL AAIAIAKKGN EVDYQLHPMD FLMYAYLQTG
QDDKARTTEQ EAVGMENKGY GRGREPFYYY VQAHFPSMLA LELRDWKAAE ALQPVEGGEP
GFKAITYWAQ AVGAGHLKDV AKAQEAVKNV DAAIEAENKA HPEYSHAPVN TDKNEAHAWL
AYAQGNNDEA FRLLKEVIDY QDKVGKGEVE LPAREMYADM LLELNRPADA LEQYKISLKT
DPNRFNGVYG AGKAAEMAGQ HEVAVGYYKQ LAENCKEAAP VRSELAHARE VAGGATVAAG
Q