Gene Clim_2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2361 
Symbol 
ID6355707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2590311 
End bp2591663 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content49% 
IMG OID642669953 
Productpentapeptide repeat protein 
Protein accessionYP_001944363 
Protein GI189347834 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATA AACAGTTACT TTTCCGGGCG ATCCTCCTTT CTGCTTTTTG TATTTCCCCA 
TCAACAGTGT TCGGTTTTGA TCCCGAGGCT GTTACAATAC TGAAAAACAG CCGTGAAGAG
TGGCAGGCCT TGCGGCGCTC AAATCCCGAA AAAACCATAG ATCTGAACAA GGCAAAGCTC
GAAGATGCAG ATCTCGAAGG AGCCAACCTC AGCAACGTAT CACTGGTAAG AGCCGAGCTT
AGTGGTGCAA ATCTCAACAG AGCAAATCTG CAGAAAACAA ATCTTGCCAT GGCATTCATC
AAAAAAGCAG ACCTGAAGGA AGCCAACTTC AGTGGCGCAT CACTGACGAA AGCCAATCTC
AAGGAATCCT TCATGAAAGG AGCTTCGTTT TCCAGGGCAA ATCTGCAGGG TGCAAACCTG
AGATGGTCGA TGCTTGAAAA TGCCGACCTA TCACAGGCAA ATCTTTCCGG AACCGTTCTT
TTCGAAGCAA ATCTTGAAAA TGCCAATCTG AAAGGGACAA ACTTCAAAGG CTCGGTTTTC
ATCGACCAGG CAAACCTCAG CGGAGCTCTG GTATCGAACA ATACCATAAT TCCGTCCGGC
GAAAAAGCGA CTCCGTCCTG GGCATCACTT CGCAAGGCAC GATTTTTCAG GGAGCCCGAT
ACCGAACCGC CGGCCTATCT GACGCCTCCT GAACCCACCA TGCTGAACGA ACAGGCGTCA
GCTTCCGGAA GCAACCTGAA AACCAGCGGA CTTAAAGTGA AGACCACTCC GGGCGACAGC
AGAAAACAAC AGGAACTGCT CACTGAAGAT GTAGAAACAT GGAACAGCAT GAGGGAGAAA
AATCCTGAAT TGCCAATTAC AATGAAACAG GAAAAACTTG AGAATGCCGA TCTCAAGGGG
GTAAACCTTT CGCAGGCCTC AATGGCTGGA TCGGATTTTG AAGATGCCAA TCTTGACAAT
GCACTCATGA ACGGAGCGGA TCTGACCGGC TCGAATTTCC AGAAAGCCGA TATGAAAGCG
GTTAAACTTC ATGGGGCCAA ACTCCACAAA GCAAACTTCG ACCGAGCCTT TCTGAAAGGA
TCTGATCTCA GCAATGCCGA TCTGACACAG GCTAATCTCT ACGGCGCAAT CATGACCGGA
ACGAATCTGA GCGGTGCCGA TCTGACCGGA GCGTCACTTT TCGATACTGA TCTTGAGGAA
GCCGACCTGT CGGGTGCAAT TCTGAAAGAT GTCACCATGA TGGATACAAA CCTGAACAAT
GCCATCATCA CCTCTGAAAC CGTTCTTCCT TCAGGGAAAA AAGCCACTGC TGATTGGGCA
GTACAGAGAG GAGCTATTTT CCGGAAGCCT TGA
 
Protein sequence
MSNKQLLFRA ILLSAFCISP STVFGFDPEA VTILKNSREE WQALRRSNPE KTIDLNKAKL 
EDADLEGANL SNVSLVRAEL SGANLNRANL QKTNLAMAFI KKADLKEANF SGASLTKANL
KESFMKGASF SRANLQGANL RWSMLENADL SQANLSGTVL FEANLENANL KGTNFKGSVF
IDQANLSGAL VSNNTIIPSG EKATPSWASL RKARFFREPD TEPPAYLTPP EPTMLNEQAS
ASGSNLKTSG LKVKTTPGDS RKQQELLTED VETWNSMREK NPELPITMKQ EKLENADLKG
VNLSQASMAG SDFEDANLDN ALMNGADLTG SNFQKADMKA VKLHGAKLHK ANFDRAFLKG
SDLSNADLTQ ANLYGAIMTG TNLSGADLTG ASLFDTDLEE ADLSGAILKD VTMMDTNLNN
AIITSETVLP SGKKATADWA VQRGAIFRKP