Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2226 |
Symbol | |
ID | 4026418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2497501 |
End bp | 2498934 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637967431 |
Product | sigma-54 RpoN |
Protein accession | YP_574276 |
Protein GI | 92114348 |
COG category | [K] Transcription |
COG ID | [COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog |
TIGRFAM ID | [TIGR02395] RNA polymerase sigma-54 factor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATGA AACCTACGCT TCAGCTTCGC GTCGGCACCC AGCTGACGAT GACGCCTCAG TTGCAGCAAG CGATCCAGCT CTTGCAGCTC TCGACGCTGG ATTTGCGTCA GGAGATCCAG CAGGCGCTGG ACGCCAACCC CATGCTGGAA CTCGAGGAAC ACTTCGACGA GCGCGAAGTC AGCGAGACGC AGGACGACGA CTGGAGCAGC GAAATTCCGG AAGACCTGCC GCTGGACAGC GACTGGAGCG ATACCTACCA GGACGCCGGG CTCGGCTCAT TGAACGGCAG CGGGGGCAGC GAGGAAGGCC CCGACATCGA GCGCAACACG GCGCACACCA GCCTTCACGA TCATCTGGGC TGGCAACTGG CCATGACCGA CATGACGCCA CGCGAACATG CGATCGCCGA GAGTCTGATC GATGCCGTGG GCGCCGATGG CTACCTTACC GTGTCGCTCG AGGAGCTTCT CGATGGGCTG CGCGGTCAGG GGTTGGCCGG CCTCAAGATC GGTGACGTCG AGCAGGTCAT GATGCGCGTG CAGCAATTCG ACCCCACGGG CGTCGCCGCT CGCGATCTGC GCGAATGTCT CCTGCTGCAG CTGGGCGCCC TGCCCGAGGA CATGCCGCTG CTTCCCCAGG CCAAGCGCCT GGTGCGTCAA TTCCTCGACG CGCTCGCCGG CAACGACCGC AAGCTGCTCA AACGGCGCCT GCGTCTCGAA GAGGCGGAAC TCGACCATAT CATCGCCCTC ATCCGCACGC TCGACCCGCG CCCCGGTCTG GCCTTCGAGG ACAGCGACAA CGATTACGTC ATCCCCGACC TCATCGCCCG CCGCGTTCGT CAGGAATGGC GCATCGAGCT CAATCCCGAC GCCTTGCCGC GCGTCCGCAT CCAGCCGGAT TACGCCGCTC TGATCAAGCG TGCCGACAAG AGTGACGACA ACACCTTCCT CAAGCAGCAC CTGCAGGAAG CCAAGTGGCT GCTCAAGAGC CTGTCTAGCC GTAACGACAC CCTGTTGCGC GTGGGCCGCG AAATCATGGC CCGTCAGCTC GACTTCCTCG AGCATGGCGA AGAAGCGATG AAACCGTTGA TCCTCGCCGA TATCGCCGGC GCGGTGGACA TGCACGAATC GACCATTTCA CGCGTCACGA CGCAGAAATT CATTCATACC CCACGTGGCG TGTTCGAGCT GAAATACTTC TTCTCCAGCC ATGTGGGCGG TGGCCACGGC GAAGGCGACG CCCATTCCAG CACCGCGATA CGCGCCCGGC TGAAGAAGCT GATCGGCGAA GAGCCGCCAC GCAAGCCGTT GTCCGACAGC AAGCTCGTCG ACCTGCTGGC ACAGGACGGC ATTCAGGTCG CACGCCGCAC GGTGGCCAAG TATCGTGAAG CGATGGGCAT CCCCTCCTCC AGCGAACGCA AGCGCCTGGC CTGA
|
Protein sequence | MSMKPTLQLR VGTQLTMTPQ LQQAIQLLQL STLDLRQEIQ QALDANPMLE LEEHFDEREV SETQDDDWSS EIPEDLPLDS DWSDTYQDAG LGSLNGSGGS EEGPDIERNT AHTSLHDHLG WQLAMTDMTP REHAIAESLI DAVGADGYLT VSLEELLDGL RGQGLAGLKI GDVEQVMMRV QQFDPTGVAA RDLRECLLLQ LGALPEDMPL LPQAKRLVRQ FLDALAGNDR KLLKRRLRLE EAELDHIIAL IRTLDPRPGL AFEDSDNDYV IPDLIARRVR QEWRIELNPD ALPRVRIQPD YAALIKRADK SDDNTFLKQH LQEAKWLLKS LSSRNDTLLR VGREIMARQL DFLEHGEEAM KPLILADIAG AVDMHESTIS RVTTQKFIHT PRGVFELKYF FSSHVGGGHG EGDAHSSTAI RARLKKLIGE EPPRKPLSDS KLVDLLAQDG IQVARRTVAK YREAMGIPSS SERKRLA
|
| |