Gene Clim_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1855 
Symbol 
ID6355196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2036448 
End bp2038451 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content54% 
IMG OID642669459 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001943873 
Protein GI189347344 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.329592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAAA TCAGGGCGTT TTTTTCGGCA TTGAACAGAT GCGTTGTGTT TTTCGGCGTT 
CTTGCCGCTA TTGTTTTGTT TTCGTTTCCG CTTTTTTCCG CTGAAGTGCA GCAGGACGGA
ACGGTGCAGT CCGGCGCCGT TTCAGGAGGA TCGGTTCTGC TGAAGTCGCA AAATGAGGCG
CAGCAGGCGG TTTGGGAACC CTCGGGAATG GATTCGCCCC CGCCTTCGCT TGTTTTGCCG
CCCGAACGGG TTCTGGACAG CGTTCCCATT CGCTCTCCCG CAAAGGTCAG CGCGAATGCA
GAGCAGAGTG AAACGGATGG AAAAGGGCGT TCTGCAAGGG TTTCGAGGAA AAAACAGGTT
CGTGACAGCT TGAGGGTTGC TGCTCTGAAG CGGGCAGAGG TTCTTAAGCA AATGAAGAGA
GCTGCTGAAA AGAAAGTGCT CGACAGTCTT GAAACTGCAG CCAGGACACT GCGTATTCAG
AACAGTCTGA GGCGCGAGTC GGCAATGGTG CGGTATCGTG ACAGTGTGCG TCTTGTTTTA
GCGCATGAGC GGACCCTTGA CAGCCTGAAT GCTATCCGCA TGAAACGGAA TCGTTCATCG
GTCACAAACC GTTCGGCTCG CAAGAAGCAG TATAGCGACA GCCTTAAGAC AGCCTCAGCG
ATTGCAGGAG GTGACAGGGA GAAGCCGAAG ATGGAGCTTG TTGATAGTCA GCCGCTGCCG
GGCAGGCTTC CCGCCGATTC GGTTTTGCTT AAAAGTGGTC CGGGCAGGCC GGTTTCCGGA
CAACCGGCGT TGCAATCTCT TCAGCTTAAG CCTGTGCCGC AGCAGAAAAC GGTTTCGACG
GCTGGAAGCC GGAGCGACAG CCTTTCTGCC GCAGCGTTGC TGTATTATCG GCAGGGGAGT
TACGATCGGG CTCTTCCTGT CGCCAATCAG GCACTTTTTC TTTCCCGAAA GAGGTCGGGT
TTGAATTCGT CAACTGAATT ACCGTCGCTT GTTCTGATTG CCGATATCTA TCTGGCGGAA
AAAAAGTACC GGCAGGCGAT GCCTTTCTAT CTGCGGGCGC TTGCTATCAG CGAAAAAATG
CCTGCGCAGG ATTATTCGGT TACAGCCGGT ATTCTGTACA GTCTCGGTTT GCTTCATGCC
GCCGATGGCT CGGAGTCCAG GGCGGATGAG TATTACCGGA AAGCTCTTGC GCTCAGGGAA
AAAACCGAGG GTCCGGAGGG CGAAGGCGTT GCGGAAACGC TTGCTGCCAT GGGAAATCTG
TACAATCGTC AGGGGAAGAG CGATATCGCC ATGATGTATT ACGTCCGGGC ATTGTCGATC
CGCGAAAAGC TCGATGGTAC GGGGGCTTCA TCCGCTCCGA TTCTTCTTAA TATGGCGGCT
TTGTACAATC AGACCGGATA TTACGATATG GCGGTTCAGC TTTTTCAGCG TGCGCTCATG
ATCAATGAGA GAGTCAGGGG TCAGTTTCAT CCCGATGTTG CCGTTTCGCT CAACGGACTT
GCCATGATTT CTCTGGTGCA GCAGCGATAT ACGGAGGCGG AACTGCTTTT TCAGCGCGGG
CTCGATGTTC AGGAGAGGGC ATTTGGTCCC GATCATGCCG AGGTTGCCCT TACTCTTCAG
AGTCTCGCTT CGGTCAAAAG GCTGTTGCAG CGGTTTGACG ATGCGGAGCG GCTGATGAAA
CGATCTCTGG CCATAACGGA AAAGCATTTT CCGCCGGGAC ACCGCAATAC CGGAGCGGCA
TTGAATTCGC TTGCCCTTAT TTATGAAGCG AAGGGCGATT ATGCTGCGGC AGAGGCTTTG
TTCAGAAAAT CTCTTGCCGT GTCGGAAAAG CGTGTCGGGG GCAATCGCTT TGATGCAGCT
CAGGTGCTCG AAAATATGTC CGGCATGTAT CTGAAATCAG GGAGGCAGAA GGAGGCTGAA
GAGTATGCGA AGAGAGCGAT GCGGCTGCGG AACCTGCCGG GGGAGAGGGG AGATGAAGGA
ACGCCGGTTT TCAGAAAGAA ATAA
 
Protein sequence
MHEIRAFFSA LNRCVVFFGV LAAIVLFSFP LFSAEVQQDG TVQSGAVSGG SVLLKSQNEA 
QQAVWEPSGM DSPPPSLVLP PERVLDSVPI RSPAKVSANA EQSETDGKGR SARVSRKKQV
RDSLRVAALK RAEVLKQMKR AAEKKVLDSL ETAARTLRIQ NSLRRESAMV RYRDSVRLVL
AHERTLDSLN AIRMKRNRSS VTNRSARKKQ YSDSLKTASA IAGGDREKPK MELVDSQPLP
GRLPADSVLL KSGPGRPVSG QPALQSLQLK PVPQQKTVST AGSRSDSLSA AALLYYRQGS
YDRALPVANQ ALFLSRKRSG LNSSTELPSL VLIADIYLAE KKYRQAMPFY LRALAISEKM
PAQDYSVTAG ILYSLGLLHA ADGSESRADE YYRKALALRE KTEGPEGEGV AETLAAMGNL
YNRQGKSDIA MMYYVRALSI REKLDGTGAS SAPILLNMAA LYNQTGYYDM AVQLFQRALM
INERVRGQFH PDVAVSLNGL AMISLVQQRY TEAELLFQRG LDVQERAFGP DHAEVALTLQ
SLASVKRLLQ RFDDAERLMK RSLAITEKHF PPGHRNTGAA LNSLALIYEA KGDYAAAEAL
FRKSLAVSEK RVGGNRFDAA QVLENMSGMY LKSGRQKEAE EYAKRAMRLR NLPGERGDEG
TPVFRKK