Gene Daro_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4149 
Symbol 
ID3566649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4448008 
End bp4449477 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content58% 
IMG OID637682621 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_287345 
Protein GI71909758 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCG CTCTCCAGCT AAAACTATCC CAGCATCTGG CGCTGACTCC TCAGCTCCAG 
CAGTCGATCA AGCTACTTCA GTTGTCGACG GTGGAAATGC AACAGGAAAT CGAGCGTTAT
CTGCTCGAAA ATCCCATGCT GGAACGCGAA GACGAGCATG GCGGGGATAA TTTCTCAGCC
GCCCAGCAGT TCGACGCGCC CCAAAGCAAC GAAGGTGAGC GCGAGCAAAA GGTTGAGCGG
GAAGAACGGG ATGCCCGCGA GGAACGCGAG CAGGATGTGC CATCGGCTCC GTCGGAGGTG
GACGATGACC GCTGGGCCTC CGATGCCGGG ACCTTTACCG GCGCCGGTCG CGATGAAGAC
GACGATAGCG ATTCCCGCGA CATCCATGCG GCCAACGTCA GCCTGCGTGA CCATCTGGGC
TGGCAACTGG GCATGACCCA GCTATCGGAG CGCGACCGCA GTCTGGTGCG TTTCCTGATC
GAAGCACTCG ATGATGATGG CTATCTCTCG GCGCCGCTGG TCGAGTTGTG GGAAACCCTG
CCGCCGGAAT ACGAAATTGA AATCGAAGAG CTGGAAATTG CGCTGCGCCA TATCCAGAAT
TTCGACCCGA TCGGCATTGG TGCCCGCAGT CTTCAGGAAT GCCTGCAGCT TCAGCTCAAG
GTTTTGCCGG TATGTGCCGA ACGTACACTG GCCCTGGCTA TTGTCGACAA GCATCTCGAA
CTGCTCGCCG CCCGTGATTT CGCCAAGATT CGTCGTCTGA CAGGTTGCGA TGACGAGGCT
TTGAAGGCGG CGCACAGCCT GATCACCAGC CTCAACCCAC GGCCAGCTGC CGGGTATGCC
CAGATTGAAG CGCGCTACAT CACGCCGGAT GTGATCGTGA AGAAGCTAAA GGGCAAGTGG
ACCGCCTACA TCAATCCGGA TGCCTACCCT CGGTTGCGCA TCAACCGTCT GTATGCCGAA
ATTCTCGCCA AGCAGCGGCG GGGCAACGGC AATCTATCTA CGCAGTTGCA GGAGGCGCGC
TGGCTGATCA AGAATGTCCA GCAGCGCTTC GAAACCATAC ACCGTGTCAC GCAGACCATT
GTCGATCGCC AGCGCCAGTT CTTCGAACAC GGCGAAGTGG CCATGCGGCC GCTGGTGCTG
CGTGAAATCG CCGATATTCT CGGCCTGCAC GAGTCGACTG TATCGCGGGT GACCAGCCAG
AAATACATGG CGACACCGCG CGGTATCTTC GAACTGAAAT ACTTCTTTGG CAGCCATGTC
TCCACCGATA GCGGTGGTGC TTGTTCTGCT ACGGCCATCC GCGCCCTGAT CAAGCAATTG
ATTGCGGCTG AGGATGGCAA GAAGCCCTTG TCGGATAGCC AATTATCCGA AATACTAGGC
CAGCAGGGAA TCGTAGTTGC CCGACGGACG GTTGCCAAAT ACCGCGAGTC GCTCAACATC
CCCCCGGTCA ATTTACGCAA GACCCTGTAG
 
Protein sequence
MKPALQLKLS QHLALTPQLQ QSIKLLQLST VEMQQEIERY LLENPMLERE DEHGGDNFSA 
AQQFDAPQSN EGEREQKVER EERDAREERE QDVPSAPSEV DDDRWASDAG TFTGAGRDED
DDSDSRDIHA ANVSLRDHLG WQLGMTQLSE RDRSLVRFLI EALDDDGYLS APLVELWETL
PPEYEIEIEE LEIALRHIQN FDPIGIGARS LQECLQLQLK VLPVCAERTL ALAIVDKHLE
LLAARDFAKI RRLTGCDDEA LKAAHSLITS LNPRPAAGYA QIEARYITPD VIVKKLKGKW
TAYINPDAYP RLRINRLYAE ILAKQRRGNG NLSTQLQEAR WLIKNVQQRF ETIHRVTQTI
VDRQRQFFEH GEVAMRPLVL REIADILGLH ESTVSRVTSQ KYMATPRGIF ELKYFFGSHV
STDSGGACSA TAIRALIKQL IAAEDGKKPL SDSQLSEILG QQGIVVARRT VAKYRESLNI
PPVNLRKTL