Gene Tery_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0385 
Symbol 
ID4241619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp597558 
End bp598646 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content40% 
IMG OID638105712 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_720326 
Protein GI113474265 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACG AACAGAAAAC AACTAGAAAT ATTGATTCAC TATTTTGGCT AGAATGGCAA 
AAGCATCAAG AGTATCTCTA CCGTTGCTGT GTCAAATGGA TGGGAGGTAA TTCTACAAAT
GCTGAGGATG CTTTAAGTAT GGCCATGCTC AAGGCTAGGG AAAAAATACA ACAGTCTTCC
AGAAGAATTG AGAACTTAAA ACCTTGGCTG GCTAAACTAA CCTATAACCT TTGTATGGAT
CTACTGAAGG AGTCTGCTCG ATATAATCAA GGGGTTGAGG ATCTAGACTT GGTTATTTCT
GGCGCTGATG GGAGCACTCA AGGAGGAGAT CCATTTTTTG TTGTTGCCTA CGAAGAACTA
AAAGATTTTT GTAACTTGGC CATTGATGAT TTGCCAAAGA GACTACGAGA AACTTTTGTT
CTCCTTTATC AGAAACAATT CTCTTCTCAA GAAATAGCTG CAGAGTTGAA TATTTCTGAC
TCTAATGTCC GTAAGCGTAT TTCCCGGGGG CGGGCTATTT TGGAAAAAAG GCGCCAGAAA
GAGTACGAAA AACAAGAAGA AATAGTTATT GTGGAGAATC AAAAAATAGA AAGTTCCCAA
CCTCAAGAGT TGGATACAGA AATTGTTACT GCTGAGATAC CCCAAGAGAC TATTTTATCT
GAAGAGAAAA GTGAGCCTAT TTTAGTTGAG GCCACGGTTG AGGAGGAGTT ACGGGAAATA
GAGACTTTTG GCAATGGAGG ACAGCAGCTC TTTGCTCTAT CTGTATTGTT GAAATTACTT
AGGGTAGAGA ATAAAAAGCC CAAGGGTGAG AAACAACAAA AATGTGGTCT ACTTAGTAGA
TGCACAAATA TAGGACTGGC ATTGCTGAGG GGCAGGATGA ATATAAGTGG AGTCCGCATC
CTGCTAACCC AAAAATATAA GAAACGGCTC AAATGGCTCA CCCACAAAGT GATAGAGGCT
CAACCACATT ATTTATACAA TTTTGGCAAG GGGGAACTAA AAGCTTTAGA AAAACAGTTG
AATTGGTTAA TAATGGAGAA TTTCCTAGGG CCTTCAGCAG CAAAACTGGT ATATAAAGAC
TGCTGCTAG
 
Protein sequence
MLNEQKTTRN IDSLFWLEWQ KHQEYLYRCC VKWMGGNSTN AEDALSMAML KAREKIQQSS 
RRIENLKPWL AKLTYNLCMD LLKESARYNQ GVEDLDLVIS GADGSTQGGD PFFVVAYEEL
KDFCNLAIDD LPKRLRETFV LLYQKQFSSQ EIAAELNISD SNVRKRISRG RAILEKRRQK
EYEKQEEIVI VENQKIESSQ PQELDTEIVT AEIPQETILS EEKSEPILVE ATVEEELREI
ETFGNGGQQL FALSVLLKLL RVENKKPKGE KQQKCGLLSR CTNIGLALLR GRMNISGVRI
LLTQKYKKRL KWLTHKVIEA QPHYLYNFGK GELKALEKQL NWLIMENFLG PSAAKLVYKD
CC