Gene Tery_4416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4416 
Symbol 
ID4246069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6801445 
End bp6802677 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content39% 
IMG OID638109300 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_723877 
Protein GI113477816 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.464298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT CAGATCAAAA AAATCCAGCT AGACCTCAAA TAGGTGTTTA TGTAGTCGCT 
ATAGCTGTAA GTACAGGTTT GACTTTAACT GCCATACGCG CTTTTCCTAG AATATTTCTA
CCCACAGATA ACAGGGAAAC GAGTCAAAAT AAACCTCAAA GTCAGCTAGT AGTAAATACA
AAAGTTCCTC AGATAGCACA AGTACCAATA AAAGCTGATA GTTTTGTCGC CACTGCGGTT
GAGAAAGTAG GACCGGCTGT CGTACGCATA GATACAGAAC GTACAGTAGC GCGTAATACA
CCCAATTTTT TTAATGACCC ATTTTTCCGT CGCTTTTTTG GAAATGATAG TTTTTCCCAA
GTTCCTAAGA AGTTTCAACA ACAGGGACAA GGCTCTGGTT TTATTACTGA TAGTAGTGGT
ATTATTTTGA CTAATGCCCA TGTTATTAAA GGTGCAGATT CAGTTACAGT TAAGCTTAAA
GATGGGCGGA GTTTTGAGGG AGAAGTAAGA GGTCTTGATG AACCTTCTGA TTTAGCAGTG
ATCAAAATTG ATGGGGAAAA TTTACCTGTT GCATTTTTAG GAAATTCTGC TCGGGTCAAA
GTCGGCGACT GGGCGATCGC TGTAGGAAAT CCCCTGGGGT TAGATAATAC GGTAACTTTG
GGTATTGTTA GTTCTCTAAA CCGCGCTAGT TCGGAAGTTG GTATCCCTGA TAAACGTCTT
GATTTTATTC AAACTGATGC TGCTATTAAT CCTGGTAACT CTGGAGGTCC TTTGGTAAAT
TCTCAGGGAG AAGTTATTGG TATTAATACA GCTATTCGTG CTGATGGGCA AGGTATCGGA
TTTGCTATAC CTATAGATGA GGCAAAGGTG ATTCAAGAAA AGTTAGTTAA AGGTGAAAGT
ATACCTCGTC CTTATATTGG GGTGCGGATG GTTACTTTGA CTCCAGAAAT TATTGAAAAA
ATTAATAAAA ATCCCAATTC CTCAATACAG TTGCCTGAGA CTGATGGTGT TTTAATCGCA
CAAGTAATTT CTAATAGTCC AGCAGCTAAA GGGGGTTTAC GACTTGGGGA TGTGGTTACA
GAAATTGATG GTCAAAAAAT TGCTACTGCT GAAGAATTAC AGAGTATAGT TCAGAAAGGT
CAAATTGGTA AACCTCTAAA TATTACGGTA AAACGTGGTA AAGAGACTCA AACGTTTTCT
GTGAGTCCAC AAGAATTACA GGATGCTAAT TAA
 
Protein sequence
MKISDQKNPA RPQIGVYVVA IAVSTGLTLT AIRAFPRIFL PTDNRETSQN KPQSQLVVNT 
KVPQIAQVPI KADSFVATAV EKVGPAVVRI DTERTVARNT PNFFNDPFFR RFFGNDSFSQ
VPKKFQQQGQ GSGFITDSSG IILTNAHVIK GADSVTVKLK DGRSFEGEVR GLDEPSDLAV
IKIDGENLPV AFLGNSARVK VGDWAIAVGN PLGLDNTVTL GIVSSLNRAS SEVGIPDKRL
DFIQTDAAIN PGNSGGPLVN SQGEVIGINT AIRADGQGIG FAIPIDEAKV IQEKLVKGES
IPRPYIGVRM VTLTPEIIEK INKNPNSSIQ LPETDGVLIA QVISNSPAAK GGLRLGDVVT
EIDGQKIATA EELQSIVQKG QIGKPLNITV KRGKETQTFS VSPQELQDAN