Gene Lcho_3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3542 
Symbol 
ID6160914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3959226 
End bp3960167 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content68% 
IMG OID641666315 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_001792561 
Protein GI171060212 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000381173 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCTACCA TGAACCTGTC TGCATCTGCT GCCTCGACCG CCGTCATCGT CCGTGATCCG 
TGGGCCCTGG TTCCCTCGCT GGGCAATCTG GACGCCTACA TCAGCGCGGT CAACCGTCTG
CCGCTGCTCA CGCACGAGGA AGAAGTGAGC TTCGCGCGGC GTCTGCGTGA CAGCCAGGAC
GTCGAAGCGG CCGGCCGGCT GGTGCTGTCG CATCTGCGCC TGGTGGTGTC GGTGGCGCGT
CAGTACCTCG GTTACGGCCT GCCGCACGGC GACCTGATCC AGGAAGGCAA CGTCGGCCTG
ATGAAGGCGG TCAAGCGTTT CGACCCCGAG CAGGGCGTGC GCCTGGTCAG CTACGCCATG
CACTGGATCA AGGCCGAGAT CCACGAGTAC GTCCTGAAGA ACTGGCGCGT GGTCAAGCTC
GCCACCACCA AGGCGCAGCG CAAGCTGTTC TTCAACCTGC GCTCGATGAA GCGCCAGCTC
AAGGGCGAAG CCGCCGACGG CGACACCCAT CGCAGCTCGC TGACCGAAGC CGAGATCGAC
ACCGTCGCGC GCGAACTCAA CGTCAAGCGC GAAGAAGTGA TCGAGATGGA GGCGCGTTTC
GCCGGCGGCG ACGTGGCGCT CGAGCCCGGC TCCGACGAGG ACGACGAGAG CTACACGCCG
ATCGCCTACT TGGCCGACGA GCGCCAGGAG CCGACCCGCG CGCTCGAGGC CGCGCACCGC
GACGAACTCG CCGGCCCCGG CCTGCTGCGC GCGCTCGACG CGCTCGACGC CCGCAGCCGC
CGCATCGTCG AGGAGCGCTG GCTGAAGGTC AACGACGACG GCTCGGGCGG CCTGACGCTG
CACGACCTGG CGGCCGAATA CGGCGTCAGC GCCGAACGCA TCCGCCAGAT CGAGGTGGCG
GCGATGAAGA AGATGCGCAA GGCGCTGGTC GAACACGCCT GA
 
Protein sequence
MSTMNLSASA ASTAVIVRDP WALVPSLGNL DAYISAVNRL PLLTHEEEVS FARRLRDSQD 
VEAAGRLVLS HLRLVVSVAR QYLGYGLPHG DLIQEGNVGL MKAVKRFDPE QGVRLVSYAM
HWIKAEIHEY VLKNWRVVKL ATTKAQRKLF FNLRSMKRQL KGEAADGDTH RSSLTEAEID
TVARELNVKR EEVIEMEARF AGGDVALEPG SDEDDESYTP IAYLADERQE PTRALEAAHR
DELAGPGLLR ALDALDARSR RIVEERWLKV NDDGSGGLTL HDLAAEYGVS AERIRQIEVA
AMKKMRKALV EHA