Gene Tery_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2553 
Symbol 
ID4244855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3936845 
End bp3938164 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content40% 
IMG OID638107628 
ProductECF subfamily RNA polymerase sigma-24 factor 
Protein accessionYP_722227 
Protein GI113476166 
COG category[K] Transcription 
COG ID[COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAACA GATACAATAG AACTGCAAAT ATTGACCGGC TATTTTGGCA AGAGTGGCAA 
AAGCATCAAG ACTATCTCTA CCATTGCTGT GTCAAATGGA TGGGAGGTAA TTCTATAAAT
GCTGAGGATG CTCTGAGTAT GGCTATGTTG AAGGCTAGGG AAAAAGTACA AAAGTGTCAC
AAAACAATTG ATAACTTCAA AGCTTGGTTG GCAAAACTCA CCTATAACCT TTGTATGGAT
CTACTGAAGC AGTCTGCTCG CTATCATCAA AAGGTTGAGG ATCTAGACTT GGTTCTATCT
CGCGCTGATG GGAGGACTCA AAGAGGAGAT CCATTTTTTG CTGTTGCCTA CGGAGAGTTA
GAAGATTTTT GTCGCTTGGC TATTGACAAT TTGCCAAAGA GACTACGGGA AACTTTTGCT
CTCTTTTTTA AAGAATACTC TTATAAGGAA ATAGCTACAG AGTTAAGTAT TTCTGAGCCT
AATGTCCGTA AGCGTATTTC CCAGGGGCGG GCTATTTTGC GAGAAAGGTA TGAGGAATAC
CAGAAACAAA AAGAAATAGT TATTTTTGAG GAGCACAAAG TAGAAAATTC TCCAGCTCAA
GAGTTGGAAA CAGAAATTAT TGCTACTGAG ATGCCCCAAG AGGCTGTTTT ATCTGAAGAG
AAAAGTGAGC CTATTTTAGT AGAGGCAACG GCTGAGAAGG AGTTAGGGAA AATAGAGACT
GTTGGCTATA GGAAACAAGA GCTTGTTGCT CCATCAGTGT TAGTGAAATC ACTTAGGGAC
GCTAAGAAAA AGCACTTAGT AGAGGCAACG GCTGAGGAGG AGTTAGGGAA AATAGAGACT
GTTGGCTATA GGAAACAAGA GCTTGTTGCT CCATCAGTGT TAGTGAAATC ACTTAGGGAC
GCTAAGAAAA AGCACTTAGT AGAGGCAACG GCTGAGGAGG AGTTAGGGAA AATAGAGACT
GTTGGCTATA GGAAACAAGA GCTTGTTGCT CCATCAGTGT TAGTGAAATC ATTTAAGTAT
GCTAATAAAA GGTACAAAGA TAAGACACAA CACAAATGTG GTCTACTGAG TACATGCACA
AATATAGTAC CGGTATTGCT GAGGGGCAGG ATGAATATAA GTATAGTCGG CTTCCTGCTA
GCCCAACAAT ATAAGGAACC GGCCAGGGGG TTAATCCACA AAGGTCTAGA AGACATAACA
CATTTATACA ACTTTGGCAA CAGAAAAATA AAAGCTTTAG ACAAACAGTT TAATTGGTTA
GCAGCGGATA ATTTGCTAAG GCTTTCAGCA GAGAAACTGA GGTACAAATT CTGCTGTTAG
 
Protein sequence
MLNRYNRTAN IDRLFWQEWQ KHQDYLYHCC VKWMGGNSIN AEDALSMAML KAREKVQKCH 
KTIDNFKAWL AKLTYNLCMD LLKQSARYHQ KVEDLDLVLS RADGRTQRGD PFFAVAYGEL
EDFCRLAIDN LPKRLRETFA LFFKEYSYKE IATELSISEP NVRKRISQGR AILRERYEEY
QKQKEIVIFE EHKVENSPAQ ELETEIIATE MPQEAVLSEE KSEPILVEAT AEKELGKIET
VGYRKQELVA PSVLVKSLRD AKKKHLVEAT AEEELGKIET VGYRKQELVA PSVLVKSLRD
AKKKHLVEAT AEEELGKIET VGYRKQELVA PSVLVKSFKY ANKRYKDKTQ HKCGLLSTCT
NIVPVLLRGR MNISIVGFLL AQQYKEPARG LIHKGLEDIT HLYNFGNRKI KALDKQFNWL
AADNLLRLSA EKLRYKFCC