Gene Clim_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1884 
Symbol 
ID6355225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2089142 
End bp2090425 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content45% 
IMG OID642669484 
Producthypothetical protein 
Protein accessionYP_001943898 
Protein GI189347369 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCG GTTTACGCTC CATATTTCTC ATACTGTGGT GTGGTCTGCT GCCCGCTAAG 
AACGGTAGTG CATATGCTGC ATTCAATACG TTCAGCCGAG TCAGCGACCT GAATTCACTT
CGTCAACTGA GTGAGCAGAT GGTATTTGTG CTGGGATACA GTGAACCCGG AGACGGTGGT
GGTGGATGGT TTCGCTGGGA ACCAAATATA ATGGAGGAGC CCGACGGGAG TATGCGGATT
CGCCCGCATT CATTCAAACA AGGATGCTTT GCACGGGTAA CTGATGGTGC CGGATTGAAT
GTCAAATGGT TCGGTGCTAA AGGTGACGGT AAACACAACG ATACGGAGGC CATTCAATCC
GCAATTGAAT GGGCCTCCGC TCGACAATCT TTTTTTCAGC AGTCAATTGC AGTCCGTGAC
CTGGTGCTGA TACCATCCGG TCAGTTCCTG GTAGACAGCC TAGAACTAAA AAGCGGGGTG
ATCCTGCAGG GTGCAGGCCA GTTTTCCTCC GTGATCCTTC ATACGGGAAA CTCAAGCCGT
TGCATCTACA ATGAAAAAGG CCATCACAAC CGCTGGGTTG GAATCCGGGA GCTCACCGTG
ATCGGTTCCG ACAATAAGGG TACCTATACA GAAGGTATTC ATCTGTTTGA GGCCAATTAC
AGCAGCTATA TTAACCGGAT TACTATTCGG GGCTTCACCC AAAATATTGT TCTGGAAGAT
TGCTGGACTT TCCAGCTTAC CCGTTCACAT CTGTTCAAAG CCCATCGCAA TAATCTAACC
ATCCTTAATG GTACAGCAAT GGAGATTTCT GGGAACCGGA TTGACGGTGC CGGAAAATCG
AATATTCAGA TAAGCCGAAG TAAAAGATAC AGGAACACTG GAATCCTGAT CAGAAACAAT
GCCATTCAGC AAGCTCAGGA ATACGGATTG TATTGCAGGG ACACCAACTC ATTATTGCTA
GAAGGAAACT TCTTCGAAGC TAATAACCGA AATGGAGGCT TCGCCTTTGT TTATATTGAA
GGGCCTCAAA CAAGCAAGCA TTGCCTTATA CATTCTACAT CGAATTACTT TTCAGGAGCA
AATAAATCAG CGCCAAATTC CGTCGGGATA TTTCTGAAAG GCAATGTGAA AAGCTTTTCA
TCGAATCAGG ATTACTTCTC CGGTAGTATG GGATACGGAA TATATTCAGT TGACCTGCAA
TCCAAAGAAT TTGTGATTTC AGGCACTACA TTTCATTCTA AATCCGACTT AAAATTACCT
TCAGATATTA AAATTATTAA TTAA
 
Protein sequence
MKSGLRSIFL ILWCGLLPAK NGSAYAAFNT FSRVSDLNSL RQLSEQMVFV LGYSEPGDGG 
GGWFRWEPNI MEEPDGSMRI RPHSFKQGCF ARVTDGAGLN VKWFGAKGDG KHNDTEAIQS
AIEWASARQS FFQQSIAVRD LVLIPSGQFL VDSLELKSGV ILQGAGQFSS VILHTGNSSR
CIYNEKGHHN RWVGIRELTV IGSDNKGTYT EGIHLFEANY SSYINRITIR GFTQNIVLED
CWTFQLTRSH LFKAHRNNLT ILNGTAMEIS GNRIDGAGKS NIQISRSKRY RNTGILIRNN
AIQQAQEYGL YCRDTNSLLL EGNFFEANNR NGGFAFVYIE GPQTSKHCLI HSTSNYFSGA
NKSAPNSVGI FLKGNVKSFS SNQDYFSGSM GYGIYSVDLQ SKEFVISGTT FHSKSDLKLP
SDIKIIN