Gene Tery_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3978 
SymbolnusA 
ID4244544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6152058 
End bp6153338 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content38% 
IMG OID638108894 
Producttranscription elongation factor NusA 
Protein accessionYP_723476 
Protein GI113477415 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGG TTAGTTTGCC TGGACTTAAA GAGTTAATCG GAAATATTAG TAAAGAACGT 
AATTTACCAA AACAAGCAGT TCAGACTGCT TTACGAGAAG CTCTATTAAA AGGATATGAA
CGCTACAGAC GAACTCAACG TGTAGATGGA GTCAACTTCA CAGATGATTA CTTCGAAAAT
TTTGAAATAG AACTGGATAT TGAAGAAGAA GGTTATCGAG TATTAGCTAC AAAAACTATT
GTTGAAGAAG TAACAAATCC AGATCATCAT ATAGCTCTCC AAGAAGTTTT AGAAGTAGCC
TCAGAAGCTC AGTTAAATGA TACAGTATTC TTGGATGTGA CACCCGAAAA AAATGAATTT
GGTCGCATGG CTGCTATACA GACTAAACAA GTATTAGCTC AAAAGCTACG AGATCAACAA
CGAAAAATGA TTCAAGAAGA ATTTCAAGAT TTAGAAGGAG AAGTTCTCCA AGCCAGAGTT
TTGAGATTTG AAAAACAGTC AGTTATCTTA GCTGTAAGTA GTGGATTTGG TAGACTAGAA
GTAGAAGCTG AACTGCCTAA AAAAGAACAA CTACCTAATG ATAACTACCG TGCTAATGCT
ACTTTTAAAG TCTATTTGAA GCGAGTTTGT GAAGGGTCAA CTCGTGGGCC TCAATTACTT
GTGTCTAGAG CTGATGCCGG CTTAGTGGTT TATTTATTTG AAAATGAAGT CCCAGAAATT
GAAGATGAAG TCGTAAGAAT TGTGGCAGTT GCTAGAGAGG CAAATCCCCC ATCTCGGCAT
GTTGGTCCCA GAACTAAAAT AGCAGTTGAT ACTTTAGAAA GGGATGTAGA TCCAGTGGGA
GCTTGCATTG GTGCAAGGGG CTCAAGAATT CAGGTAGTTG TGAATGAGTT GAGAGGTGAA
AAAATAGATG TAATTCGCTG GTCTCCAGAC CCTTCTATAT ATATAGCTAA TTCTCTTAGT
CCAGCTAGAG TAGATGAAGT TCGTTTAATT GATCCAGAGG AAAGGAGGTC TCATATTTTG
GTGTCTGAAG ACCAACTTAG TTTGGCTATC GGCAAGGAAG GACAAAATGT GCGTTTAGCT
GCTCGTTTGA CAGGGTGGAA AATTGATATT AAGGACACAA ATAGATATGA CCATGCTGAA
GAAGATAGCA AAGTTGCGGC TGAAGTCTCT CATCGTCAAG CGTTAGCTGA ACAAGAAGAA
AATAAAATTG AGGAATCAGA ATTAGAAGTA ATAGAAAATA CTTTCGACAA AAATTTTAAT
GAACCAGATG ATTCTTTTTA A
 
Protein sequence
MSMVSLPGLK ELIGNISKER NLPKQAVQTA LREALLKGYE RYRRTQRVDG VNFTDDYFEN 
FEIELDIEEE GYRVLATKTI VEEVTNPDHH IALQEVLEVA SEAQLNDTVF LDVTPEKNEF
GRMAAIQTKQ VLAQKLRDQQ RKMIQEEFQD LEGEVLQARV LRFEKQSVIL AVSSGFGRLE
VEAELPKKEQ LPNDNYRANA TFKVYLKRVC EGSTRGPQLL VSRADAGLVV YLFENEVPEI
EDEVVRIVAV AREANPPSRH VGPRTKIAVD TLERDVDPVG ACIGARGSRI QVVVNELRGE
KIDVIRWSPD PSIYIANSLS PARVDEVRLI DPEERRSHIL VSEDQLSLAI GKEGQNVRLA
ARLTGWKIDI KDTNRYDHAE EDSKVAAEVS HRQALAEQEE NKIEESELEV IENTFDKNFN
EPDDSF