Gene Cpha266_0368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0368 
SymbolnusA 
ID4569346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp410366 
End bp411922 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content44% 
IMG OID639764966 
Producttranscription elongation factor NusA 
Protein accessionYP_910851 
Protein GI119356207 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00190694 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGAA AGCAGGTAAA AACGGAGGGG CAGGACAGGA AAGCGCAGAT TGCCAGTGCT 
TTCGGAGAAA TCGAGCAGTC CAAAATCTTT CTGGACAAAC GTACGGAAAG TGCGGCTGTC
AAAATGGATA TTGCAGACCT TCTTAAAGAT ATCATCCAGA AGCAGCTCAG AAAGGATTAT
GATCCTGAAG TTGAGGCTAA TATTTTCATT AATCCCGAGC GCGGCGATTT TGAGGTTTAT
ATTCTGAAAA AGGTTGTCAG TGAAGTTGAT CTCGAAAGCA TTGAAATCAG CATCGATGAA
GTCAGAAAAA TAGACGATTC TCTTGAACTC GGAGATTATT ATGAAGAGGG CCCGATCAAG
CTTGAGGATT ATCTGAGCAG AAAGTCTATC CAGATTATCA AGCAGTCTGT ACAGAAAAAG
GTACGGGATC TTGAGCGTCT TGTGGTGTAT GAGGAGTGCC TTGAAAAAGT TGGCGAGGTT
GTGGCCGGTG AAGTATATCA GGTTCGGTCA AACGAAGTGA TATTTACCTA CAACACATCA
AAAGATCATC GTGTTGAGCT TGTTCTGCCA AGAGCCGAAA TGATAAAAAA GGATAATCCT
CGCAGAAACC CGAGGATGAA GCTCTACGTC AAGCGCATCG AACGCGAGAA GGTTAAGGTA
AGGCTTGATG ACGGCGGCGT TGTCGAGCGC GATAAACCTG ATGGGGGAAT GAAAGTAATT
GTTTCAAGGA TTGACGATCG CTTTTTGTAC AAACTGTTTG AACATGAGGT TCCGGAAATT
CTCGATGGTC TTATTGTGAT AAAAGGAATT GCCCGTGTTC CAGGCGAAAG GGCAAAGGTT
GCCGTTGAGT CCACCAGTGC CAGAATAGAT CCTGTAGGAG CAAGCGTAGG GTATCGCGGA
AAAAGGATAC AGAGTATCGT CAAGGAACTG AACAATGAGA ATATCGATGT TATTTATTAT
ACCGATGAAC CGCAGATTTA TATAGCTCGT GCTTTGCAGC CAGCCAAGAT TGATCCCTTG
ACGGTTCATG CCGATATTAA AACCCGAAAG GCAAGGGTAA TGCTGAAGCC GGACCAGATC
AAGTATGCCA TTGGAAAAAA CGGTAATAAC ATACATCTGG CCGAAAAGCT TACCGGATAT
GAAATCGATG TGTATCGCGA TGTGATGGAC AAGTCGATGG AAGATCCTAC TGATATTGAT
ATTATTGAGT TCAGGGAAGA GTTTGGCGAT GATATGATCT ATCAATTGCT TGACGGTGGC
CTTGATACGG CCAAGAAGGT ACTGAAAGCA GGTGTTGAAA GGATTGAGGA GGTTTTGCTC
GGCCCTTCAG CTCCAGAGGA GATTACTTTT TTTTCAAAAG GTCGAACCAG GAGCCCGATT
AAGCCAAGAG AGCGAAAGGT AACCGAAGAG GAAAAGCGGT ACTGGAAAAA AATTGCCGAA
AATATTTTTA AAACCGTTAA AGAGCAGTTT ACTGATGCTG ATTTGCATGA TCTGTTCGAC
GATGGAGATG AAGACAGTGA CGATCAACAA GCCGCTGAAA GTCCGGCAGA TGAATGA
 
Protein sequence
MVRKQVKTEG QDRKAQIASA FGEIEQSKIF LDKRTESAAV KMDIADLLKD IIQKQLRKDY 
DPEVEANIFI NPERGDFEVY ILKKVVSEVD LESIEISIDE VRKIDDSLEL GDYYEEGPIK
LEDYLSRKSI QIIKQSVQKK VRDLERLVVY EECLEKVGEV VAGEVYQVRS NEVIFTYNTS
KDHRVELVLP RAEMIKKDNP RRNPRMKLYV KRIEREKVKV RLDDGGVVER DKPDGGMKVI
VSRIDDRFLY KLFEHEVPEI LDGLIVIKGI ARVPGERAKV AVESTSARID PVGASVGYRG
KRIQSIVKEL NNENIDVIYY TDEPQIYIAR ALQPAKIDPL TVHADIKTRK ARVMLKPDQI
KYAIGKNGNN IHLAEKLTGY EIDVYRDVMD KSMEDPTDID IIEFREEFGD DMIYQLLDGG
LDTAKKVLKA GVERIEEVLL GPSAPEEITF FSKGRTRSPI KPRERKVTEE EKRYWKKIAE
NIFKTVKEQF TDADLHDLFD DGDEDSDDQQ AAESPADE