Gene Clim_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1868 
Symbol 
ID6355209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2053636 
End bp2054931 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content47% 
IMG OID642669469 
Producthypothetical protein 
Protein accessionYP_001943883 
Protein GI189347354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000489897 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTT CGCATGGAAA TCCTTATCGG GGTGAACATG GGTTGCCGGA TATCCTGTAT 
TTGTCACCGG ATGATTTCAC CGTCACTATT TCGCGGATGA ATCTGAATGC CGTCGATCAT
GTGGCGAAAT GGACCATCCT GATTTATATG GCTGCCGATT GTGATCTTGC TGCGTTTATG
TTTGATGACC TGATGGAAAT GAAGGTTGTC GGTTCGAATG AAGATGTAAA TATATGCGTG
TTTTTCGATG GTCCTTTATT GACCGATACC TTTTTTGCAA GACTGTGCCA TGGGACGAGC
CTTGAGGAGG ATATTATTCA ACGGTTTACC GATGTGCCAA GTTCGAATGT CGGTATTCTT
AAAGAGATTA TTCTCAATAC TGCCGTACTT TTTCCTGCTG AAAGGAGAGT GCTTGTTTTA
GCTGGTCACG GTTTGGGTTG GCGAGGAGCT TTGCGGGATG ATTCGACGTG GAAGCGTTTC
AAGGAACGAA GGGCAATCGT TATGCCGTCA GGAGATTCCT CCGTTTTTTT CCGTCAGCTC
GATGAGCAGA GACAAAGAGC GCTCGAGGAG TTGAAAGCCC GTCTGAATCC CCGGGACGAG
CATCATGGAT CGGCATTTGA TATTATTGCC ATGGATGCCT GTAACATGGG TAATCTGGAG
GCTTTGTCCT TTTATTCGGA TCACGCACGC ATTCTTGTGG CTTCAGAAAA CCAGGTGCCT
GCATCAGGTT ACCCTTATGA TAGAATTCTT GAGGAACTGA AGAGAAATCC TGAACAGGAG
TGTGACGCGT TTGCCCGCTA TCTTGTGAAC GAGGTGAAAC GCTATTATGT GGATTCAATA
TTGTTATGCA GTGAGAGCGA TATAACGCAG GTTGCATTTG ACAGTACCGG ATTTCCGGCA
TTGATTGCGC ATGCAGGAGA GCTTGCGCGC GTACTGTCGG AATATGTTTC TACTGAGGGC
ATTGCAACGG TCAAGGCTTG TTCCGGAGCT TCTTTATTAC CTGAGGAGGA TACGGATTAT
ATCGATTTGA GGCTTTTTGC GAAAGAACTG GTACAGGCAG GAGTTTCTGA TGCCGTAAAG
CAGAAAGCGA TGGAACTGGT GGCTTTTTTT GATGGATCGG GATTTGTTGT GGGTAGTGCA
ACTCCGGGTG GCGATGCATT GCCGAAGGGC CTTTCCATTT ATTTTCCGCC GCCGGAACGG
TTCGATAAAG GGTATCTGGA TATTCTGAGC CACGTTCCTG AAGGTATCAG GTTGTGGGCT
GGTTTTATTG GAGCGTACTA CGGGAAGAGA TTTTGA
 
Protein sequence
MIFSHGNPYR GEHGLPDILY LSPDDFTVTI SRMNLNAVDH VAKWTILIYM AADCDLAAFM 
FDDLMEMKVV GSNEDVNICV FFDGPLLTDT FFARLCHGTS LEEDIIQRFT DVPSSNVGIL
KEIILNTAVL FPAERRVLVL AGHGLGWRGA LRDDSTWKRF KERRAIVMPS GDSSVFFRQL
DEQRQRALEE LKARLNPRDE HHGSAFDIIA MDACNMGNLE ALSFYSDHAR ILVASENQVP
ASGYPYDRIL EELKRNPEQE CDAFARYLVN EVKRYYVDSI LLCSESDITQ VAFDSTGFPA
LIAHAGELAR VLSEYVSTEG IATVKACSGA SLLPEEDTDY IDLRLFAKEL VQAGVSDAVK
QKAMELVAFF DGSGFVVGSA TPGGDALPKG LSIYFPPPER FDKGYLDILS HVPEGIRLWA
GFIGAYYGKR F