Gene Cphamn1_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0401 
SymbolnusA 
ID6374063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp424695 
End bp426260 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content46% 
IMG OID642682918 
Producttranscription elongation factor NusA 
Protein accessionYP_001958847 
Protein GI189499377 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000166 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.293714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAA AGCAGGTAAA AAAAGAAAAG CAGGATCGTA GAGAGCTTAT CGCAAATGCT 
TTTGGTGAAA TTGAGCAGTC GAAGGTTTTT CTGGAAAAAC ATACGGAGAG CGCCGCAGTG
AAGATGGATA TTGCGGATCT TCTGAAGGAT ATTATACAGA AGCAGTTGCG CAAGGACTAC
GACCCCGAGG TTGAGGCTAA TATTTTTATC AATCCGGAGC GGGGCGACTT CGAGGTGTAT
ATTCTGAAAA CAGTTGTTGA TGAGGTTGAT CTTCCTTCCA TTGAGATCGG ACTTGAGGAG
GTTAGCAGGA TCGACGAGTC TCTTGAGCTT GGGGACATGT ATGAAGAGGG TCCTGTCAAC
CTCGAGGATT ATCTGACCAG AAAGTCGATA CAGATAATCA AGCAGTCGGT TCAGAAAAAG
GTGCGGGACA TGGAAAAGCA GGTGGTGTAT GAGGATTGTC TCGAGAAAGT TGGTGAAGTT
GTCGCAGGGG AGGTCTATCA GGTCAGGCAG AACGAGGTTA TTTTTTCCTA CAACACCTCA
AAGGATCACC GTGTCGAACT GGTACTTCCG AAATCCGAAA TGATGAAAAA GGACAATCCA
AGGCGCACCC CGAGGATGAA GCTCTATGTG AAAAGGATAG AGAGAGAAAA AGTGAAAGTC
AGACAGGATG ATGGCTCCAT TGTCGAGAAA GAGAAGCCTG ATGGCGGGAT GAAGGTTATT
GTTTCCAGAA TCGACGACCG GTTTCTCTAC AAGCTTTTTG AAAGTGAAGT GCCTGAAATT
CTGGATGGGC TTATTGTCAT CAAGGGGATT GCAAGAGTTC CGGGTGAACG GGCCAAGGTT
GCTGTTGAGT CGACAAGTTC GCGGATAGAT CCTGTAGGGG CGAGCGTGGG ATACCGGGGA
AAGCGTATTC AGAGTATAGT CAAGGAACTC AACAATGAGA ATATCGATGT TATCAATTTT
ACCGACGATC CACAGATCTA TATCGCCCGG GCACTTCAGC CGGCGAAAAT CGATCCTATG
ACAGTTCATG CGGATATGAA GACGCACAAA GCCAGAGTGA TGCTCAAACC TGAGCAGATC
AAGTACGCCA TAGGTAAAAA CGGTAACAAT ATTCATTTGG CCGAGCGTCT TACCGGTTAT
GATGTGGATG TCTACAGGGA CGTTATCGAT AAATCCATGG AGGATCCTAA CGATATCGAT
ATTATAGAGT TCCGCGAAGA ATTCGGCGAT GATATGATCT ACCAGCTTCT CGACAGCGGG
CTTGATACAG CCAAAAAAGT GCTCAAGGCT GAAATAGAAG ATATTGAAGC TGCTCTGGTA
GGGCCGCCTT CTAAAAGCGA GGAATCAGCG TTCTTCACCA AAGGAAGAAA AGCTCCGTTC
AAACCCAAAG AGAGAACGCT CAGCGAAGAT GAGAAACGGT ATTGGAAAAA GATCGCTGAA
AATATTTACA AGACAGTGAA GGAGCAGTTT AACGAGGCGG ATCTGCAGGA GATAATAGAT
GAAGAGGACG AAGATATACT GAGTGATGGC GATGCAGATG TGAGCGTTGA TGATCAGAAC
AACTGA
 
Protein sequence
MARKQVKKEK QDRRELIANA FGEIEQSKVF LEKHTESAAV KMDIADLLKD IIQKQLRKDY 
DPEVEANIFI NPERGDFEVY ILKTVVDEVD LPSIEIGLEE VSRIDESLEL GDMYEEGPVN
LEDYLTRKSI QIIKQSVQKK VRDMEKQVVY EDCLEKVGEV VAGEVYQVRQ NEVIFSYNTS
KDHRVELVLP KSEMMKKDNP RRTPRMKLYV KRIEREKVKV RQDDGSIVEK EKPDGGMKVI
VSRIDDRFLY KLFESEVPEI LDGLIVIKGI ARVPGERAKV AVESTSSRID PVGASVGYRG
KRIQSIVKEL NNENIDVINF TDDPQIYIAR ALQPAKIDPM TVHADMKTHK ARVMLKPEQI
KYAIGKNGNN IHLAERLTGY DVDVYRDVID KSMEDPNDID IIEFREEFGD DMIYQLLDSG
LDTAKKVLKA EIEDIEAALV GPPSKSEESA FFTKGRKAPF KPKERTLSED EKRYWKKIAE
NIYKTVKEQF NEADLQEIID EEDEDILSDG DADVSVDDQN N