Gene Clim_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0299 
SymbolnusA 
ID6353816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp334432 
End bp335973 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content48% 
IMG OID642667928 
Producttranscription elongation factor NusA 
Protein accessionYP_001942372 
Protein GI189345843 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000558254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGAA AGCAGATAAA AGCGGAAGGG CAGGACAGGC GGGCGCAGAT AGCGAGCGCT 
TTCGGGGAAA TCGAGCAGTC GAAGATCTTT CTGGATAAAC GAACGGAGAG TGCGGCTGTA
AAGATGGATA TAGCTGATCT TCTCAAGGAT ATTATTCAGA AACAGCTTCG CAAGGATTAT
GATCCGGAAG TAGAGTCAAA TATTTTTATC AATCCGGAAC GAGGCGATTT TGAGGTCTAT
ATTCTCAGAA AAATCGTTCA GGAGGTCGAT ATTCCCGCTA TTGAAATCAG TCTGGACGAG
GTGAGAAAAA TCGATGAATC TCTTGATCTC GGCGATTTCT ACGAAGAGGG CCCGATCCGT
CTCGAAGATT ATCTGACACG AAAATCTATT CAGATAATCA AACAGTCCGT ACAAAAGAAA
GTCCGCGATC TTGAACGGCT TGTTGTGTAT GAAGAGTGCC TGGAAAAAGT CGGAGAGGTT
GTTGCCGGAG AGGTTTACCA GATTCGTTCC AATGAGGTCA TCTTTACCTA TAATACCTCG
AAGGATCATC GGGTTGAGCT GGTGCTGCCG AAATCGGAGA TGATGAAAAA AGACAATCCC
CGCAGAACGC CAAGGATGAA ACTCTACGTC AAACGGATCG AACGGGAGAA AGCCAAGGTG
CGGCTTGATG ACGGAGGCGT GGTTGAAAAG GAAAAACCCG ATGGCGGCAT GAAGGTTATC
GTGTCACGAG TCGATGATCG TTTTCTCTAC AAGTTGTTTG AACACGAAGT CCCCGAAATA
CTGGACGGTC TCATTGTTAT CAAGGGTATT GCCCGCGTTC CCGGAGAACG GGCGAAAGTT
TCCGTCGAGT CGACCAGTGC ACGAATCGAT CCCGTAGGAG CGAGTGTCGG TTATCGCGGG
AAACGTATTC AGAGTATAGT CAAGGAGCTC AATAACGAGA ATATCGACGT CATCTACTAT
ACCGACGAAC CGCAGATATA CATTGCCAGA GCGCTGCAGC CGGCCAAGAT AGATCCGCTG
ACGGTTCATG CCGATATAAA AACCCGCAAG GCAAGGGTTA TGCTCAAGCC CGATCAGATC
AAGTATGCGA TCGGCAAGAA CGGCAATAAC ATCCATCTTG CAGAAAAGCT TACCGGTTAT
GAAATCGATG TCTATCGTGA TGTGATCGAC AAATCACTGG AAGATCCGAC CGATATCGAC
ATCATCGAGT TCCGTGAAGA GTTCGGCGAC GATATGCTCT ACCAGCTGCT CGATGCCGGT
TTCGATACAG CTAAAAAAGT ACTGAAGGCG GGCATCGAAG AGATCGAACA AGCCCTTCTT
GGCCCGGCAA AACCTGAGGA GGTTCTTATC TTCGGAAAAG GGCGTAAAGC TCCTTTCAAA
CCGAGAGAAC GCAAGGTAAC GGATGAGGAA AAACGGTATT GGCGAAAGAT TGCTGAGAAC
ATTTACCGGA CGGTCAAAGA GCAGTTCAGC GATTCGGATT TTCGTGACCT GATCGATGAT
GCCGGTGACC GGGAAACGGT CAGTCTGAGT GCTGATGAAT GA
 
Protein sequence
MARKQIKAEG QDRRAQIASA FGEIEQSKIF LDKRTESAAV KMDIADLLKD IIQKQLRKDY 
DPEVESNIFI NPERGDFEVY ILRKIVQEVD IPAIEISLDE VRKIDESLDL GDFYEEGPIR
LEDYLTRKSI QIIKQSVQKK VRDLERLVVY EECLEKVGEV VAGEVYQIRS NEVIFTYNTS
KDHRVELVLP KSEMMKKDNP RRTPRMKLYV KRIEREKAKV RLDDGGVVEK EKPDGGMKVI
VSRVDDRFLY KLFEHEVPEI LDGLIVIKGI ARVPGERAKV SVESTSARID PVGASVGYRG
KRIQSIVKEL NNENIDVIYY TDEPQIYIAR ALQPAKIDPL TVHADIKTRK ARVMLKPDQI
KYAIGKNGNN IHLAEKLTGY EIDVYRDVID KSLEDPTDID IIEFREEFGD DMLYQLLDAG
FDTAKKVLKA GIEEIEQALL GPAKPEEVLI FGKGRKAPFK PRERKVTDEE KRYWRKIAEN
IYRTVKEQFS DSDFRDLIDD AGDRETVSLS ADE