Gene Clim_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0931 
Symbol 
ID6354168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1018684 
End bp1019955 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content45% 
IMG OID642668558 
Productprotein of unknown function DUF324 
Protein accessionYP_001942989 
Protein GI189346460 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0413017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAGA CATTCAAATC ATTCGTTGAA CCCTTTCCGC ATACAATATC AGAAATTAGT 
GAGCTGCTAT CAAACTTAAG CCAAGCAGAG CAGGCTATCA AAGTGAATAA GAGAAATAAG
GAGGTGCGAC AGCAATACGA AAACCAAGCT AAAGATTTGC ACCGACAACT GCTTACGCAT
AAAGATTGCT CATTGATATA CAACTATATG GAAGCGACAG GCATAGCGGA TAAGGATACT
TTTCGTAGCA CTTGGGCAAG GGAGAAGCTA AAAGTTGATT CTGATGAGTT ACCAGATTTT
CTCAAACCGC CCCTGCTTGA TGATTTGTCC TGTTTGCCTG TCGGTTCATT TTATATTCAG
TTCAAGTTTA CCTTGCTCAA GCCGTATATC TCTCGGGACG ACAATGCGTT TTATCTTGTG
GACAATTCCA TTGTGCGGGA AAAGGTTTTT CGCTTTCCGA TGGTGCGTTC CACCGCATGG
AAAGGTTCAT TGCGCCACGC TCTGTGGCAA ATGGATGGAT ATCAGAAGGA AGACCAGCAA
GATCAGCAAA TCAAGCGCCT ATTCGGTACA GCCAATGATG AACAACCGGA GGAAGGGAAC
AGTGGCCGTT TTTATTTTTA CCCCTCTTTC TTTACCCTAA ACAGCTTGGA AGTCATCAAC
CCCCACGGCC GGAAAACGCG TGTAGGCACA ACTCCTATCC TCTTCGAATC TGTACCTATT
GGTGCCGAGG CCACTTTCAC CCTGCTATAC TCTCCCCTTG ACCGCATCGG TAGAGAAGAT
GTCGAAACAC GTCAGCAAGT TATTGCCGAC CTGAAACTGG TAGCCGAAGG GCTACGGGCG
TTGTTTATCG TATATGGTTT TGGAGCCAAG ACCAGTAGTG GGTTCGGCCT GGCTAATGAT
GCGATTGAAA ATGGTTCTCT AATTCTCAAT TTGCCCAGCT TTGTTTTCCC TCAGGCTGAA
ACCGTTCAAG TTCAAACGCC GGAAGATGTC TTTCTGAAAT ACATGGATGA ATCTGGTTAT
CTTAAAACTG TTTTTTCAGG AGGCGGTGAC TACGGGTTGA TGAGCAATAA GGAGTATGGA
GAAAAAAGAG AGCAGTTGGG TGGTGGGTAC CTTACCGAGT TCAAGGCATT TCGCAAATGG
TATGGTGAAA ACGGTGAACA ATGGCAAAAG TCAATAAAAG AAAAAAGCTC AGCTACGAAC
TACCCTCAAC TCAGGTTTAA AAGTCTCAGC CAATTAGCCG AGAGGATGCG TCAGACGGGA
GGGGAAGCAT GA
 
Protein sequence
MNQTFKSFVE PFPHTISEIS ELLSNLSQAE QAIKVNKRNK EVRQQYENQA KDLHRQLLTH 
KDCSLIYNYM EATGIADKDT FRSTWAREKL KVDSDELPDF LKPPLLDDLS CLPVGSFYIQ
FKFTLLKPYI SRDDNAFYLV DNSIVREKVF RFPMVRSTAW KGSLRHALWQ MDGYQKEDQQ
DQQIKRLFGT ANDEQPEEGN SGRFYFYPSF FTLNSLEVIN PHGRKTRVGT TPILFESVPI
GAEATFTLLY SPLDRIGRED VETRQQVIAD LKLVAEGLRA LFIVYGFGAK TSSGFGLAND
AIENGSLILN LPSFVFPQAE TVQVQTPEDV FLKYMDESGY LKTVFSGGGD YGLMSNKEYG
EKREQLGGGY LTEFKAFRKW YGENGEQWQK SIKEKSSATN YPQLRFKSLS QLAERMRQTG
GEA