Gene Cag_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0781 
Symbol 
ID3748066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1099727 
End bp1101163 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content45% 
IMG OID637773311 
Productsigma-54 factor 
Protein accessionYP_379090 
Protein GI78188752 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATT TTACGCTTCA GCAAAAGCAA GTACAACATC TATCAGCGCA ACAAATTCTT 
GGGAGCCAGC TTTTACAGCT TCCAATGCAG CAGCTTGAAG AGCGCATTTA TCAAGAAGTG
CAAGAAAATC CCATGCTCGA ATTAGTTGAA GCGCCTCGTG ATGGGCAAAT TGATGGTGTG
GTAGCTGAGC CAAGCAATGG AGCCGTGGGG GAGATGTTTG ATTCCATTGA TCGCTTTAGC
CGAGCTTCAC TCAATAGTCG TGTTCATAGC GGTAGCCCTT CAAGCGGGCA GGGGGGAAGT
GATGATAGCA AAGAGCGCTT TTTTCAAGCG GTGCAGCACG ATAGTTTAGC AGAACTACTG
TGTCGCCAAC TTGCTTTGCA AGAGCATATT GGTGAGCGTG AAATGGCAAT TGCCGAAGAG
ATTCTCGGCA ACCTTGATAG CGATGGCTAT TTTACCGAGT CCATTGAGCT TATTGTTGCA
AGTCTTCAGC AAGCGGAGGT TATTGTTAGT AATGCTGAAG TGGAAGCGGT GCTTCATTCC
ATCCATTTTC TTGACCCAGC AGGCATTGCC GTGCGCAATG TGCAACAGCG TTTAATGGTG
CAGTTGCAAG TTGCCGCCCA TCGTTATCCA GCCGCAACAT ATAATGTAGC AATGCGCTTG
CTTGGCGATT ATTACGACGA CTTTTTAAAT CGTCGGTTAG ATATGCTCCT TAAAAAACTT
GGAGTGCCAA AAGCAGAGCT TGAAGCTGCC GTTACGGCTA TTATTGCGCT CGATCTTCAT
CCGGGTGTTT TTTACGATGA GGGTGGGCAT TACATTAGCC CCGATGTTAT TGTTACCTAC
GAAAATGGTG AATTAACAGC GGCATTAAAT GACCGAAGCG CTCTCTCGGT TAAAGTAACC
GATCGCTATC GTGAACTGCT TGCAAATCGT AAAGCGCCGA AAGAGGAGAA GCAATTTATT
CGCCACAACA TTCAGCGTGC TCAAGATTTT GCAACAGCCT TAGCCATGCG CCGCCAAACC
CTTTTAAAAG TAATGGAAGC GCTTTTAAAG CAGCAATATG CTTTTTTTGT TTCGGGACCT
GAGCATGTTG TACCACTTGG CATGAAAAGT GTTGCTGAAG AGACGGGGCT TGATATTTCC
ACCATTAGCC GTGCGGTAAA TGGCAAATAT GTACAAACTC GCTTTGGAGT CTTTGAATTG
CGTTACTTTT TTGGAAGTGC ACTCTCAACC GATGAGGGCG AAGAGCTTTC AAGCAAAATT
ATTCGTCAAC ATCTTGCCGA AATAATTAAA GCCGAAGATT CAGCCCATCC ATTAAGCGAT
GACACGCTTG CCGAAATGCT GGTGAGTAAA GGTATTCGTA TTGCTCGCCG AACGGTTGCA
AAATACCGTG AACAAATGCA AATTCCCGTT GCAAGATTAA GAAAAAAAAT ATTTTAA
 
Protein sequence
MADFTLQQKQ VQHLSAQQIL GSQLLQLPMQ QLEERIYQEV QENPMLELVE APRDGQIDGV 
VAEPSNGAVG EMFDSIDRFS RASLNSRVHS GSPSSGQGGS DDSKERFFQA VQHDSLAELL
CRQLALQEHI GEREMAIAEE ILGNLDSDGY FTESIELIVA SLQQAEVIVS NAEVEAVLHS
IHFLDPAGIA VRNVQQRLMV QLQVAAHRYP AATYNVAMRL LGDYYDDFLN RRLDMLLKKL
GVPKAELEAA VTAIIALDLH PGVFYDEGGH YISPDVIVTY ENGELTAALN DRSALSVKVT
DRYRELLANR KAPKEEKQFI RHNIQRAQDF ATALAMRRQT LLKVMEALLK QQYAFFVSGP
EHVVPLGMKS VAEETGLDIS TISRAVNGKY VQTRFGVFEL RYFFGSALST DEGEELSSKI
IRQHLAEIIK AEDSAHPLSD DTLAEMLVSK GIRIARRTVA KYREQMQIPV ARLRKKIF