Gene Tery_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1624 
Symbol 
ID4242408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2481037 
End bp2482254 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content38% 
IMG OID638106765 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_721375 
Protein GI113475314 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTAT CTTTTAAACA ATTAACTTTG TATTTTTCTC TGTTGTCCAT TGGTACTGCT 
ACGGGATGGT TGGGTCATCA TTATCTCGAA GCTAATAAAT GGTCAAATGA CTCTGATGTA
ATATCTTCTG TAGTTAAGAA ACAAGCACAA CCATCTACTC CAAACTCTGG AAATAACCTA
GTTTCCTTTT CTCATCATAA TTTTATTGCC GAAGCAGTCA AAAAAGTTGG CCCATCAGTA
GTCCGTATTG ATGCAGCTAA AAAGTTAACA ACTGAAGCTC CAGAAGCTTT AAAGAATCCT
CTATTGAAAC GTTTCTTTGG GGAAAATTTG CCGGTTCCAG AAGAACGAAC TAAGCGTGGT
ACTGGGTCAG GGGTAATTAT TAGTTCTGAT GGCCGCTTAA TTACAAATGC TCATGTTGTT
CATGGAGCAA ATACGGTTAA GGTGACATTG AAAGATGGCC GGGTATTTGA TGGTGTGGTT
AAAGGGGTGG ACTCACTGAC TGATATAGCA ATAATTAAAA TTGAGGCCAC AGATTTACCA
GAGGTATCTA TTGGCAAATC AGAACAATTA ATTCCTGGAC AATGGGCGAT CGCTATTGGT
AATCCTTTGG GTTTGGACAA TACTGTAACA GTGGGAATTA TTAGTGCTAT TGGTCGCACC
AGTTCTCAAG TAGGTATTCC AGATAAACGA GTTCGCTTTC TTCAGACAGA TGCTGCAATT
AATCCTGGCA ACTCTGGTGG GCCACTTTTG AATGATCAAG GTGAAGTAAT TGGTATTAAT
ACAGCTATTA GAGCGAATGC TCAGGGGTTA GGGTTTGCTA TTCCCATAGA AACTGCAAAA
AGAATTGCTG ATGAATTATT TGTCTATGGG AAAATAGAGC ACCCATTTTT AGGTATTTCA
ATGGTTGATT TAACTCCTGA GGTCAAGGAT GAAATTAATA GAAAACTGGA TACGAAAATT
AAGGATAATC AAGGTGTAGT AATTATGAGA GTTATAGAAG ATTCTCCTGC ACAAAAAGCT
GGTTTACGTC AAGGAGATGT GATTCAAAAA GTAGGGGGAG TAGTAGTGAA AAGTCCAACA
GAAGTTCAAC AAGAAGTAGA AAAAAGTTTA GTAGGAAAAA ATTTGGCAGT GGAGGTAATT
CGTAATCGGA AAATTGCCAA AATTTTGGTT AAACCTGATG CTTTTCCTGA ACCACTTGAG
TTAGAACTAA AGGAATAG
 
Protein sequence
MALSFKQLTL YFSLLSIGTA TGWLGHHYLE ANKWSNDSDV ISSVVKKQAQ PSTPNSGNNL 
VSFSHHNFIA EAVKKVGPSV VRIDAAKKLT TEAPEALKNP LLKRFFGENL PVPEERTKRG
TGSGVIISSD GRLITNAHVV HGANTVKVTL KDGRVFDGVV KGVDSLTDIA IIKIEATDLP
EVSIGKSEQL IPGQWAIAIG NPLGLDNTVT VGIISAIGRT SSQVGIPDKR VRFLQTDAAI
NPGNSGGPLL NDQGEVIGIN TAIRANAQGL GFAIPIETAK RIADELFVYG KIEHPFLGIS
MVDLTPEVKD EINRKLDTKI KDNQGVVIMR VIEDSPAQKA GLRQGDVIQK VGGVVVKSPT
EVQQEVEKSL VGKNLAVEVI RNRKIAKILV KPDAFPEPLE LELKE