Gene Tery_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4058 
Symbol 
ID4242086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6266294 
End bp6267253 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content40% 
IMG OID638108962 
ProductRNA polymerase sigma factor SigD 
Protein accessionYP_723543 
Protein GI113477482 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.284305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0136203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG GTAAAACCTC AGACATAGAT TCTGTATGCA CTTATCTAAG AGAAATTGGT 
CGCGTTCCTC TATTAAGCCA CGAAGAAGAA ATTCTTTATG GAAAACAAGT TCAACGCTTA
ACGGCTTTGC AGGAATTTAA CCAAACGTTG GCAGAAGCTT TAGGCCGTGA ACCAACATTA
TTAGAATGGG CTGAGGCCAA GGAGCTGTCG AAGTCTGAAT TGCAGCGAAT TATTAGGGAA
GGCGAAAGAG CCAAGCGAAA AATGGTAGAG GCAAATTTGC GTTTGGTAGT ATCTGTTGCT
AAAAAATATA TCAAGCGAAA TGTAGATTTG CTTGATTTAA TTCAAGAAGG CACCATTGGT
ATGCAGCGAG GTGTAGAAAA GTTTGACCCG ACTAAGGGCT ATAGATTTTC TACTTATGCT
TATTGGTGGA TTCGTCAGGC TATAACCAGA GCAATAGCAG AAAAAGGCCG CACAATTCGT
TTACCTATCC ATATTACAGA AAAGCTAAAT AAGATTAAAA AAGTTCAGCG CCAACTGACT
CAACAGCTAG GACGTTCGGC TACTACTGGT GAAATAGCAG AAGAACTAGG TTTAACTCCA
AAACAAATAC GGGAATGCTT AGAACGGGCT CGCTTACCTT TATCTCTAGA TTTGCGAGTG
GGAGATAATC AAGATACAGA ATTAGGAGAC TTACTAGAAG ATACAGGAGC TTCTCCAGAA
GATTATGCTC TCCAATCTTC CATGCGAACT GATTTAGAAT CAATCATGGT TGACCTTACT
CCTCAACAAA AGCAAGTTTT GGCCTTAAGG TTTGGTTTAG AAGATGGCCA AACTATGACT
CTATCGAAAA TTGGTGCCCA CCTGAATATT AGTCGTGAAA GAGTACGGCA GATAGAGCGG
GAAGCTTTAA GTAAACTTCG TAAAAGAAAA CAAGATATGA ATGATTATTT AGCTAGTTAG
 
Protein sequence
MKIGKTSDID SVCTYLREIG RVPLLSHEEE ILYGKQVQRL TALQEFNQTL AEALGREPTL 
LEWAEAKELS KSELQRIIRE GERAKRKMVE ANLRLVVSVA KKYIKRNVDL LDLIQEGTIG
MQRGVEKFDP TKGYRFSTYA YWWIRQAITR AIAEKGRTIR LPIHITEKLN KIKKVQRQLT
QQLGRSATTG EIAEELGLTP KQIRECLERA RLPLSLDLRV GDNQDTELGD LLEDTGASPE
DYALQSSMRT DLESIMVDLT PQQKQVLALR FGLEDGQTMT LSKIGAHLNI SRERVRQIER
EALSKLRKRK QDMNDYLAS