Gene Tery_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1956 
Symbol 
ID4244380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3031977 
End bp3033209 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID638107075 
ProductRNA polymerase, sigma 28 subunit 
Protein accessionYP_721682 
Protein GI113475621 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.672319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00257504 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCACTA CCTCAGAAAT TTATCCTTAC ACTGAAGACA CACTGCAATC TAATAACTTG 
CAGAATGAGT GTTCTGAACA TCTAGGTCAA ACTATCAATG AATTAACAAT AGAGGATAAA
AAAGTAGATA GTTCTCAAAT AAACACTAAC TATAGTGGCA GAGATTCAGT CCGTTCATAC
CTCCAAGAAA TTGGGCGAGT TGATTTGCTT CAAAGAGATC AAGAAGTTTC TACAGCTCAA
AAAGTTCAAC GTTATGTACA TTTATTGGAG TTACGCAACA AAGCCGCTGA ACAAGAACAA
GAAATTCAGC GTTATGTTGA GATGATCAAG ATTCGTGATC GTCTGGCAGC TCGTTTAGGA
CATAAACCAT CATTTGAGCG GTGGCTAAAG GCAGGAAATT TATCTATGGC AGAACTGAAA
GAGATCATTG CTGTAGGAAA ACTTCGCTGG GCAAAGTTAA CAGGTTTAAA TGTAAAGGAA
CTTGAGCAAA TTCAAACCGA AGGCATTGAA GCGAGACATC AGATGATAAG TACTAACTTG
CGCTTAGTAG TCTCAATAGC AAAAAAGTAT CAAAATCGTG GTGTAGAACT TTTAGATTTA
ATCCAAGAAG GTACTTTGGG TTTAGAAAGA GCTGTTGAAA AATTTGATCC ACAACGGGGA
TATAGATTTA GCACTTATGC TTACTGGTGG ATTCGTCAAG GAATTACAAG AGCAATTGCT
ACTCAAAGTC GAACAATTCG TTTACCTATC CATGTTAGCG AAAAATTAAA TAAAATCAAA
AAAGTACAAC AAAAAATTTT TCAAGAAAAA GGATATACGG CTAAAGTTGA AGAAATTGCC
CGAGAATTAA AAATTACAGC TGCCCAAGTT CGAGAAGTGT TAGTAAAAAT TCCACACTCG
GTTTCCTTAG AAACTAAGGT CGGTAGGGAT AGAGATACTG AATTAGGGGA ACTTCTGGAA
ACTAAAGATG CTTCTCCAGA AGAAATGTTA ATACGAGAGT CTCTGGTACA AGCTTTAAAA
GAGTTACTAT TAGATTTAAC TCAACGAGAA AGGTATGTAA TTACTATGCG TTATGGCTTA
GAAGATGGTC GTGCCTGCTC CTTATCAGAA ATTGCGTCTG CATTGAAACT CTCTCGGGAA
CGAGTACGTC AAATTGAAGT CAAGGCTCTA CATAAGTTGC GTCAACCAAA GTTTCGTAAT
CAAATACAGG ATTATTTAGA ATCTTTGAAT TAA
 
Protein sequence
MITTSEIYPY TEDTLQSNNL QNECSEHLGQ TINELTIEDK KVDSSQINTN YSGRDSVRSY 
LQEIGRVDLL QRDQEVSTAQ KVQRYVHLLE LRNKAAEQEQ EIQRYVEMIK IRDRLAARLG
HKPSFERWLK AGNLSMAELK EIIAVGKLRW AKLTGLNVKE LEQIQTEGIE ARHQMISTNL
RLVVSIAKKY QNRGVELLDL IQEGTLGLER AVEKFDPQRG YRFSTYAYWW IRQGITRAIA
TQSRTIRLPI HVSEKLNKIK KVQQKIFQEK GYTAKVEEIA RELKITAAQV REVLVKIPHS
VSLETKVGRD RDTELGELLE TKDASPEEML IRESLVQALK ELLLDLTQRE RYVITMRYGL
EDGRACSLSE IASALKLSRE RVRQIEVKAL HKLRQPKFRN QIQDYLESLN