Gene Tery_4663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4663 
Symbol 
ID4246317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7170231 
End bp7171403 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content41% 
IMG OID638109528 
Product30S ribosomal protein S1 
Protein accessionYP_724104 
Protein GI113478043 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0608752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00279875 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACAAATT TGAAAACACC AGCCAAGGAA GTCGGCTTTA CTCACGAAGA TTTTGCAGCC 
CTACTAGATA AATATGACTA TCACTTTAGT CCTGGAGATA TTGTAGCAGG GACAGTATTT
AGCTTAGAAC CAAGGGGAGC ATTAATTGAC ATTGGAGCTA AAACTGCTGC TTATATTCCC
ATCCAAGAAA TGTCAATTAA TAGAGTAGAA AACCCTGAAG AAGTTCTACA GTCTAATGAA
ACAAGAGAAT TTTTTATTCT CACAGATGAA AATGAAGATG GACAGTTAAC TTTGTCCATC
CGAAGAATAG AATATATGCG GGCCTGGGAA AGAGTCCGAC AATTACAAGC AGAAGATGCA
ACTGTGCGGT CACAAGTATT TGCAACAAAT CGTGGTGGGG CATTAGTTAG GATAGAAGGT
TTGCGTGGAT TTATTCCTGG TTCACATATT AGTACACGCA AACCAAAAGA AGATTTACTT
AATGAAGAAT TACCTTTAAA ATTCCTAGAA GTAGACGAAG AACGTAACCG TCTGGTTTTA
AGTCATCGTC GCGCCCTAGT TGAGCGGAAG ATGAACCGCC TAGAGGTAGG GGAAGTAGTG
ATAGGCGCAG TTCGTGGTAT TAAGCCTTAT GGAGCTTTTA TTGATATTGG AGGAGTAAGT
GGCCTACTCC ATATATCAGA AATTTCTCAT GACCATATAG AAACTCCTAG CAGTGTACTA
AAAGTTAATG ATGAACTGAA AGTTATGATC ATCGATCTAG ATGCTGACAG AGGTCGTATA
TCTTTGTCAA CAAAGCAACT AGAACCAGAA CCAGGGGCGA TGGTGAAAAA TCCACAAATG
GTATATGACC AAGCAGAAGA AATGGCTGCT AAGTTTAGGG AAAAAATGAT GGCTCCTCAG
GGAGGGGAAA CAGCAGTAGA AACACCTGAA GTTGGAGTAA TAGCAGAAGC GACAGAAGCA
CCGGAAGGGG TGGCAACAGC AGAAGCGCTA GAAGCACCGG AAGAGGTAGC AACATCAGAA
GCACTAAAAG TACCGGAAGA GGTAATAACA TCAGAAGCGC TAGAAGTGCC AGAAGAGTTA
GCAACAGAAG CATCGCAAAA AACAGAGGAG ACTGAGACTA CAGTAGAAAT AGCTATCGTA
GATGAAGAGA CATCAGCAGC AGCTTTAGAA TAA
 
Protein sequence
MTNLKTPAKE VGFTHEDFAA LLDKYDYHFS PGDIVAGTVF SLEPRGALID IGAKTAAYIP 
IQEMSINRVE NPEEVLQSNE TREFFILTDE NEDGQLTLSI RRIEYMRAWE RVRQLQAEDA
TVRSQVFATN RGGALVRIEG LRGFIPGSHI STRKPKEDLL NEELPLKFLE VDEERNRLVL
SHRRALVERK MNRLEVGEVV IGAVRGIKPY GAFIDIGGVS GLLHISEISH DHIETPSSVL
KVNDELKVMI IDLDADRGRI SLSTKQLEPE PGAMVKNPQM VYDQAEEMAA KFREKMMAPQ
GGETAVETPE VGVIAEATEA PEGVATAEAL EAPEEVATSE ALKVPEEVIT SEALEVPEEL
ATEASQKTEE TETTVEIAIV DEETSAAALE