Gene Tery_5059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5059 
Symbol 
ID4246714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7719995 
End bp7721242 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content38% 
IMG OID638109861 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_724437 
Protein GI113478376 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.269801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA CAATGGGCAA AGTATTTTGG AAACAACCTT TAACTTATTT ATTACTACTA 
GCAGGAGCAG TCGGAGCATT ATTAGGTGAG CGACTAATAT TACAGACAAG TAGCTCTCCA
GAAAACTCAA GTCAGCTTAC TGAACTATCA GTGACTCAAT CTCTTAGCAA AACTGATAAT
ACTTCTAACT CAGAAAAGTC CACTTGGTTA CCAGTTAGAG CCCCCATTTC CAATAGCAAT
TTTATTGTAA ATGCAGTCCA AAAAGTTGGT CCAGCAGTAG TCAGGATTAA TGCTTCTCGA
GCTGTCAGCC AAAGACCTAA TATGTATGGA TTTAGGGTAC CAGAAGATTT CTATGGTTTT
GAATTACCTA GATCGCGTAA TAGTCCAATT GAGCAAGGAA CTGGTTCTGG TTTTATCATC
AGTTCTGATG GTAATATTCT TACAAATGCT CATGTTGTCG AGGGTTCAAC TACTGTAGAA
GTGGTCCTTA AAGATGGTCG TCGCCTTCAA GGTAAAGTTT TGGGCACAGA TTCTCTAACT
GATGTAGCAG TAGTTAAAAT TGATGCTGGT AGTCTTCCAA CTGTTAAGAT CGGAGATTCA
AATAATCTGC AACCTGGAGA ATGGGCGATC GCTATTGGCA ACCCCCTAGG TCTAGATAAT
TCTGTGACGG TGGGCATAAT TAGTGCCACA GGTCGTTCTA GTAATGATGT GGGTGTTCCA
GATAAGCGGG TAGGATTTAT TCAAACAGAT GCTGCAATTA ATCCTGGTAA TTCTGGTGGT
CCTCTGTTGA ATCAAAATGG TGAGGTAATT GGCATTAATA CAGCTATTAT TGATGGTGCT
CAAGGTTTAG GATTTGCAAT TCCTATTAAT AATGCTCAAC AAATTGCTAA ACAATTAATT
AAGGTAGGTA AAGCAGAACA CGCTTATTTA GGTATTGCTA TGCAAACTCT TACACCAGAA
CTTAAGCAAG AACTGAACCG AAATTTCAAT ACAAATATGT TTAGTGACCA AGGGGTATTA
GTAATACAAG TTGTTCCTGG TTCTCCTGCT GATAAAAGTG GTTTAAAACC AGGGGATATA
ATTCAAAGAA TTGATAATCA AACTATTACT ACATCTGAAA ATGTACAGCA AATTGTTCAG
AACAAAACAG TAGGTAGTTT GTTGGAATTA GAAATTAATC GGAATGGTAA AAGCTTGAAT
TTGGATGTAC GAACTGGAAA TTTACCACCT AGAAGATTCA GAGGATAG
 
Protein sequence
MKNTMGKVFW KQPLTYLLLL AGAVGALLGE RLILQTSSSP ENSSQLTELS VTQSLSKTDN 
TSNSEKSTWL PVRAPISNSN FIVNAVQKVG PAVVRINASR AVSQRPNMYG FRVPEDFYGF
ELPRSRNSPI EQGTGSGFII SSDGNILTNA HVVEGSTTVE VVLKDGRRLQ GKVLGTDSLT
DVAVVKIDAG SLPTVKIGDS NNLQPGEWAI AIGNPLGLDN SVTVGIISAT GRSSNDVGVP
DKRVGFIQTD AAINPGNSGG PLLNQNGEVI GINTAIIDGA QGLGFAIPIN NAQQIAKQLI
KVGKAEHAYL GIAMQTLTPE LKQELNRNFN TNMFSDQGVL VIQVVPGSPA DKSGLKPGDI
IQRIDNQTIT TSENVQQIVQ NKTVGSLLEL EINRNGKSLN LDVRTGNLPP RRFRG