Gene Paes_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0366 
SymbolnusA 
ID6460557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp400056 
End bp401609 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content46% 
IMG OID642724364 
Producttranscription elongation factor NusA 
Protein accessionYP_002015070 
Protein GI194333210 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000647313 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA AGCAGGCAAA AGGGGAGACT CAGGATCGCA AGGCGCAGAT CGCAAGTGCT 
TTTGGTGAAA TCGAACAATC AAAGGTCTTC CTGGACAAAC GCACTGAAAG TGCCGCGGTG
AAGATGGATA TTGCTGATCT CCTGAAAGAT ATTATTCAGA AGCAGCTGCG GAAAGATTAC
GATCCGGAAG TCGAAGCCAA TATTTTTATC AATCCTGAGA GAGGGGATTT CGAGGTCTAT
ATTCTTAAGA AAATCGTCGA TGAAATCGAT ATTCCCTCGA TAGAGATCTC CATTGACGAG
GTTCGTCAAA TAGATGACTC CTTGGAAGTA GGCGATTATT ACGAAGAAGG CCCTATCAAT
CTTGAAGAGT ATCTTACCAG AAAGTCGATT CAGATTATTA AACAGTCCGT TCAGAAAAAG
GTACGTGATC TGGAACGTCA GGTCGTCTAT GAGGAGTGCC TTGAGAAGGT GGGAGAGGTT
ATAGCAGGAG AGGTTTATCA GATTCGTCCC CATGAGGTGA TTTTCAGTTA CAATACCTCA
AAGGACCACA GGGTTGAGCT TGTCCTTCCG AAATCCGAGA TGATCAAGAA GGATAATCCC
CGGCGAACCC CTCGTATGAA GCTCTATGTG AAGCGTATTG AGAGGGAACG GGTAAAGGTA
AAGCAGGATG ACGGAAGCAT CGTTGAGAAG GAAAAACCTG ACGGAGGCAT GAAGGTCATC
GTTTCGCGTA CCGATGACCG TTTTCTCTAT AAACTTTTTG AAAGCGAAGT TCCGGAGATT
CTTGACGGAT TGATTGTTAT CAAGGGGATC GCCAGAGTAC CGGGAGAAAG GGCCAAAGTT
GCTGTTGAGT CAACCAGTTC CAGGATTGAT CCTGTTGGGG CAAGTGTCGG TTATCGTGGC
AAGAGGATTC AGAGCATTGT CAAGGAGCTC AATAATGAGA ATATTGACGT AATTTATTAC
ACTGATGAGC CTCAGATTTT CATAGCACGT GCACTGCAGC CGGCAAAGAT AGATCCTATG
ACCGTTCATG CGGATATGAA AACCCGTATG GCAAGGGTTA TGCTGAAGCC TGATCAGATC
AAGTACGCGA TTGGAAAAAA CGGCAATAAC ATTCATCTGG CAGAGCGACT GACCGGCTAT
GAGATCGATG TCTATCGTGA CGTGATTGAT AAAACACTCG AAGATCCAAA TGATATCGAT
ATTATTGAGT TCAGAGAGGA ATTCGGTGAC GATATGATCT ACCAGCTGCT GGACAGCGGG
CTCGATACGG CAAAGAAGGT GCTTCAGGCA GGCATAGAAG AGATTGAACA GGCACTGCTC
GGGCCTACGA AGATTGATGA GATGTCTGTC TTTGGCAAAG GCAGAAAAAC AGTCAAGCCG
AGAGAGCGCA GGCTTACTGA CGAAGAGAAG AGGTATTGGA AAAAAATTGC TGAAAATATT
TACAGGACAG TGAAGGATCA GTTTAATGAG GCTGACCTTC AGGAGATCAA TGATGAAGAT
AACGATGCAG CTCCAGAGGG CGAGGGTGCA GACGAAGGTT CTGAAAAGCA GTAA
 
Protein sequence
MAKKQAKGET QDRKAQIASA FGEIEQSKVF LDKRTESAAV KMDIADLLKD IIQKQLRKDY 
DPEVEANIFI NPERGDFEVY ILKKIVDEID IPSIEISIDE VRQIDDSLEV GDYYEEGPIN
LEEYLTRKSI QIIKQSVQKK VRDLERQVVY EECLEKVGEV IAGEVYQIRP HEVIFSYNTS
KDHRVELVLP KSEMIKKDNP RRTPRMKLYV KRIERERVKV KQDDGSIVEK EKPDGGMKVI
VSRTDDRFLY KLFESEVPEI LDGLIVIKGI ARVPGERAKV AVESTSSRID PVGASVGYRG
KRIQSIVKEL NNENIDVIYY TDEPQIFIAR ALQPAKIDPM TVHADMKTRM ARVMLKPDQI
KYAIGKNGNN IHLAERLTGY EIDVYRDVID KTLEDPNDID IIEFREEFGD DMIYQLLDSG
LDTAKKVLQA GIEEIEQALL GPTKIDEMSV FGKGRKTVKP RERRLTDEEK RYWKKIAENI
YRTVKDQFNE ADLQEINDED NDAAPEGEGA DEGSEKQ