Gene Cag_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1824 
Symbol 
ID3746455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2345498 
End bp2346481 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content40% 
IMG OID637774362 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_380118 
Protein GI78189780 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATACC AAATGCAGAT GCCTGCAAAA ATAGAGCTTG ACGAATCTTC TCATAGTGAT 
AGTTTCGGGA AGTTTATCGC CCAACCGCTT GAGCGTGGTT ATGGCGTTAC TCTTGGCAAC
TTGATGAGAA GAGTGTTGCT TGCCTCGTTA CCGGGAACTG CAATTACAGG TATCAAAATA
GAGAATGTTT ATCATGAGTT CTCTACTATT CAGGGCGTTC GCGAAGATGT TCCTGAAATT
GTGTTAAATC TTAAAAAGGT TCGCTTTCGA TCACAATGTA AGCGTAGTTG CAAAACCACG
GTAACATTGG TTGGTCCTAT GGAATTTACC GCAGGTGTTA TTCAGCCGCA AGAAGGTGAG
TTCGAGGTTC TTAATAAGGA TTTACATATT GCGACTATCA ATGCGGGTAC AACCGTTACG
CTTGATATTT TTATAGGACG TGGTCGTGGT TATGTGCCTG CTGAAGAAAA TCGTGCTGAA
GGAATGCCGC TTGGGTTTAT TCCAATTGAC TCGATTTTTA CCCCTATTCG TAATGTAAAG
TTTACGGTTG AAAATACTCG TGTGGGGCAG CGTACTGATT ATGAAAAAAT GATTCTTGAG
GTTGAGACTG ATGGTTCAAT TACTCCTGAT GATTCCATTA GTTTAGCAGG AAGAGTTATT
TCTGATCATG TTTTACTTTT TGCTGATTTC TCTCCTGCTG AAGAGGAATA CACAGAAGAA
GAGTTCAAGC AGCAAGATGA TGAGTTTGAA ACGATGCGTC GTTTGTTAGC AACAAAAATC
GAAGATCTTG ATTTATCGGT TCGCTCACAC AATTGCTTGC GTCTTGCTGA AATTGATACG
CTTGGAGAGT TAGTTTCGCA TAAGGAAGAT GAGTTGTTGA ATTACAAAAA CTTTGGTAAG
AAGTCGCTTA CCGAGCTTAA AGAGCAACTT GATAAGTTTG ATCTTAAGTT TGGTATGGAT
ATTACCCGTT ACCAAATGAA GTAA
 
Protein sequence
MIYQMQMPAK IELDESSHSD SFGKFIAQPL ERGYGVTLGN LMRRVLLASL PGTAITGIKI 
ENVYHEFSTI QGVREDVPEI VLNLKKVRFR SQCKRSCKTT VTLVGPMEFT AGVIQPQEGE
FEVLNKDLHI ATINAGTTVT LDIFIGRGRG YVPAEENRAE GMPLGFIPID SIFTPIRNVK
FTVENTRVGQ RTDYEKMILE VETDGSITPD DSISLAGRVI SDHVLLFADF SPAEEEYTEE
EFKQQDDEFE TMRRLLATKI EDLDLSVRSH NCLRLAEIDT LGELVSHKED ELLNYKNFGK
KSLTELKEQL DKFDLKFGMD ITRYQMK