Gene Csal_0446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0446 
Symbol 
ID4027020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp491627 
End bp492628 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID637965604 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_572507 
Protein GI92112579 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.384145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGTT CAGTGACAGA GTTTCTCCGT CCTCGCGACA TCAAGGTCGA AGAGATCAAC 
GCGAATCATG CGAAGATCGT GCTCGAGCCG TTCGAGCGTG GTTTCGGCCA TACCCTGGGG
AATGCTCTGC GTCGCATCCT GCTGTCTTCC ATGCCCGGTT GCGCCGTTGT GGAAGCGGAG
ATTGAGGGCG TTCTGCACGA GTACAGCGCC ATCGAGGGCG TCCAAGAGGA CGTCATCGAG
ATTCTCCTGA ACCTCAAGGA CGTTGCCGTC AAGATGCACG GTAACCGTGA CGAGGTGGTT
CTGGCGCTGA GCAAGCAGGG GCCGAGCGTG GTCACCGCTG GCGATATCGC CGTCGATCAT
GACGTCGAAA TCGTCAACCC GGATCACGTC ATCGCGCACC TCAACGACAG CGGCGAGCTG
AAAATGCAGC TCAAGGTGGT TCGCGGTCGT GGCTACGAGC CGGCGGATAC CCGTGCTTCC
GAGGAAGACG AATCGCGTGC GATCGGCCGC CTCCAGTTGG ATGCGACCTT CAGCCCGGTA
CGTCGTGTGT CCTACTCCGT GGAAGCCGCG CGTGTCGAGC AGCGTACCGA CCTCGATAAG
CTGATTATCG ACTTGGAAAC CGACGGCACC CTGGACCCGG AAGAAGCGAT TCGCCGCAGT
GCGACCATCC TCCAAGAGCA GCTGGCCGCG TTCGTCGACC TCGAAGCCGA TAAGGAACAG
GAAGTCGAAG AAGAAGAGGA TCAGATCGAT CCGATTCTGC TGCGCCCCGT AGACGATCTC
GAGTTGACCG TCCGCAGCGC CAACTGCCTG AAGGCCGAGA ATATCTATTA TATCGGTGAT
CTGATTCAGC GTACCGAAGT GGAGCTGTTG AAGACCCCGA ACCTCGGCAA GAAATCCTTG
AATGAAATCA AGGACGTTCT GGCAGCGCGC GGTCTTTCCC TCGGCATGCG GCTGGAAAAT
TGGCCGCCGG CGAGCCTGAA GGACGACAAG GCCTCTGCGT GA
 
Protein sequence
MQRSVTEFLR PRDIKVEEIN ANHAKIVLEP FERGFGHTLG NALRRILLSS MPGCAVVEAE 
IEGVLHEYSA IEGVQEDVIE ILLNLKDVAV KMHGNRDEVV LALSKQGPSV VTAGDIAVDH
DVEIVNPDHV IAHLNDSGEL KMQLKVVRGR GYEPADTRAS EEDESRAIGR LQLDATFSPV
RRVSYSVEAA RVEQRTDLDK LIIDLETDGT LDPEEAIRRS ATILQEQLAA FVDLEADKEQ
EVEEEEDQID PILLRPVDDL ELTVRSANCL KAENIYYIGD LIQRTEVELL KTPNLGKKSL
NEIKDVLAAR GLSLGMRLEN WPPASLKDDK ASA