Gene Clim_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1116 
Symbol 
ID6355758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1216936 
End bp1218390 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content50% 
IMG OID642668733 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_001943164 
Protein GI189346635 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGA TACGGCTACA ACAGAAACAG AAAGCGATTC TGTCTGCACA GCAGATACTC 
AGCAGTCAGC TTCTCCAGCT TCCGCTTCTC AATCTCGAAC AGAGGATATA TGACGAGCTG
CAGGAAAATC CCATGCTTGA GCTTATCGCT GAAAGCAAGG ATACCGCCGG GGATATCGCT
TCAGAAGACG ATAGTTCCCC GGGTGACGAT ATGTTCGGAA CGCTGGAACG ATTCAGCAAA
AGTTCGATGA AGGAGCCTCG GCAGGTTAAC ACCTCCAGGG AAGATTCCGA GGGGCGCCTG
AACTTTACTC ACGAAAGCGC ACCGCAGGAA CGATTTTTTC AGGCGGTACA GCACGACAGT
TTCGATGAGC AGCTTCTCCG GCAGCTTGCA ATGCAGGAGG GCATTGGTGA ACGGGAGGTC
ATGATTGCCA TAGAGATTCT CGGCAACCTC GATCATGATG GCTATCTTGC AGAGGATAAC
GATGTCATTC TTGCCGGTCT GCACTCCAGC GGACTGGATG CCGATGAACA TGAGATGGAA
AAAATTCTGC GCAAGATCCA CTATCTCGAT CCTCCCGGTA TTGCCGTACG GGATCTGAGA
GAACGGCTGC TTGTACAGCT TAAACTGAGG GAAGCGTCCT CTGAACAGGA GATATACCGG
ACTGCAGTTC GTATTCTTGT GCAGTTTTAC GAAGATTTTC TGCACCGGCG ATATGACCGG
ATTCTGAAAA AACTCGATCT CCCAAAAGAT CATGTCGAAG AGGCTCTTGG GATCATTACC
TCACTTGATC CGCATCCTGT CGAGCTGTTT CACGATGAGG GAGGGCATTA CATCACCCCT
GATTTTATCG TGACCTACGA AAACGGTGAG CTTACCGCCA TGCTTAACGA CCGGAGCTCG
CTTTCGGTCA AGGTTTCGGA ACAGTATCAG GGGATACTCA AAAACCGCAA GGCGCCAAAG
GATGAAAAGC GGTTTATCCG CTACAACCTT ACGAGGGCGA ACGATTTTGC CGCAGCCATA
GCCATCAGGC GCCAGACGCT TCTGAAGGTG ATCGAATCGC TTATGAAAGC GCAATACGCA
TTTTTCGTTT CCGGTCCGGA ACATCTTGTT CCTCTTGGAA TGAAAGCCAT TGCCGGCGAT
ACCGGTCTTG ATATTTCAAC CATCAGTCGG GCGGTGAACG GCAAGTATGT ACAGACCCGC
TTCGGAGTTT TCGAACTTAA ATACTTCTTC AGCAGTTCGC TTGCTACCGA CGAAGGCGAC
GACATGTCGA GTAAAATCAT CCGGCAGTAT ATCGGTGAAA TGGTGAAAGC GGAGAATCCC
GACAAACCGC TCAGTGACGA TCTGATAACC GGCCAGCTCA AGGACAAGGG GATCAACATA
GCCCGGAGAA CGGTTGCAAA ATATCGTGAA CAAATGCAAA TTCCAGTTGC AAGGCTAAGG
AAAAAAATAT TTTAA
 
Protein sequence
MAEIRLQQKQ KAILSAQQIL SSQLLQLPLL NLEQRIYDEL QENPMLELIA ESKDTAGDIA 
SEDDSSPGDD MFGTLERFSK SSMKEPRQVN TSREDSEGRL NFTHESAPQE RFFQAVQHDS
FDEQLLRQLA MQEGIGEREV MIAIEILGNL DHDGYLAEDN DVILAGLHSS GLDADEHEME
KILRKIHYLD PPGIAVRDLR ERLLVQLKLR EASSEQEIYR TAVRILVQFY EDFLHRRYDR
ILKKLDLPKD HVEEALGIIT SLDPHPVELF HDEGGHYITP DFIVTYENGE LTAMLNDRSS
LSVKVSEQYQ GILKNRKAPK DEKRFIRYNL TRANDFAAAI AIRRQTLLKV IESLMKAQYA
FFVSGPEHLV PLGMKAIAGD TGLDISTISR AVNGKYVQTR FGVFELKYFF SSSLATDEGD
DMSSKIIRQY IGEMVKAENP DKPLSDDLIT GQLKDKGINI ARRTVAKYRE QMQIPVARLR
KKIF