Gene Tery_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2988 
Symbol 
ID4245104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4643691 
End bp4644656 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content39% 
IMG OID638108023 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_722616 
Protein GI113476555 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.146033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGG GAGGTTACTC CGTGGCGCAA TTTCATTACG AGTGTGTAGA GTCTAAAACT 
GACAAAGATC GAAGTCAGTA CGGAAAATTT ATTATAGAAC CACTAGCTCG TGGTCAAGGT
ACAACAGTAG GAAATGCACT AAGGCGGGTT CTGCTATCTA ACTTAGAAGG AACAGCTATT
ACATCAGTAA GGATAGCTGG TGTTAATCAT GAGTTTGCCA CAATTAAGGG GGTAAGAGAA
GATGTCCTAG AAATCCTACT GAATATGAAA GAGGTTGTTC TCAAAAGCTA TTCTGAGCAA
CCACAAATTG GCAGGCTGCG GGTAGAAGGA CCAGCGACAG TAACTGCTGA TCGATTTGAT
GTGCCTTCAG AGGTTGAAGT AATAGATCGC AGTCAATATA TTGCTACTCT ATCCCCAGGC
TCCATTCTGG AGATGGAATT TAGAATTGAG AAAGGAACTG GTTATAAAGC AGTAGATCGT
ACTCGTGATG ATGTGGCGAC TCTTGATTTT CTGCAAATAG ATGCTATTTT TATGCCAGTT
CGTAAGGTTA ACTATACGAT TGAAGATGCC CATATAGGTA GTTCTTTAGA ACAAGACCGA
CTAATTATGG ATATTTGGAC TAATGGCAGT TATACGCCTC AAGATGCCTT GAGTAATGCC
GCTGGTATTT TAATGAGTTT GTTTGAACCA CTCAAAGATA TTACCAATAT GGCAGATATT
CCATCAGGAG AAACGACAGA CCCAACAAGT CAGATCCCAA TAGAGGAATT ACAGTTATCA
GTTAGGGCTT ATAATTGTCT AAAAAGGGCT CAAATAAACT CAGTTGCAGA CCTGCTAGAT
TATAGTCAAG AAGATTTGTT AGAAATCAAA AACTTTGGTC AAAAGTCTGC AGAAGAAGTC
ATAGAAGCTT TACAAAAGCG CTTGGGAATA ACTTTACCTC AAGAAAAAGT AGCTAAAGCT
ACTTAG
 
Protein sequence
MEKGGYSVAQ FHYECVESKT DKDRSQYGKF IIEPLARGQG TTVGNALRRV LLSNLEGTAI 
TSVRIAGVNH EFATIKGVRE DVLEILLNMK EVVLKSYSEQ PQIGRLRVEG PATVTADRFD
VPSEVEVIDR SQYIATLSPG SILEMEFRIE KGTGYKAVDR TRDDVATLDF LQIDAIFMPV
RKVNYTIEDA HIGSSLEQDR LIMDIWTNGS YTPQDALSNA AGILMSLFEP LKDITNMADI
PSGETTDPTS QIPIEELQLS VRAYNCLKRA QINSVADLLD YSQEDLLEIK NFGQKSAEEV
IEALQKRLGI TLPQEKVAKA T