Gene Tery_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1947 
Symbol 
ID4244371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3018030 
End bp3019724 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content33% 
IMG OID638107067 
Producttetratricopeptide TPR_2 
Protein accessionYP_721674 
Protein GI113475613 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0106408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA CTCATGCTAA ATCATCGGAA AAATTGACAT TTTTAACTAC ATTAAATACT 
AAAAATAATC AAACTATTTC CTCTTCTAAG GGAATTTTTA GCATCTTTTC TGCGTTCCCA
GAATATGACC GTGGTAACAG ACTGTATGAA ATGGGTAGAT ACGAGTCAGC TATTCCCTAC
TACGAAAATG CAGTGAAAAT AAAGCCAGAC TGGGCTATAG GTTGGTTAAA ACTAGCTGAG
GCCTTATCTA AATTGCAAAA GTATGAACAA GCAGTAGAGG CTTATAAAAG ATCCCTATCT
CTCAAACAAA ACGCTCATCA AGCTTGGCAT AGTTATGGAG TTGTATTATC TAATTTAAAG
CAGTATGAGC AAGCGATCGC TTGCTTTGAC AAAGCAATTA AAATTAATCC AAATGATTAT
CAATCATGGT TTAATAAAGC AATTATTTTA AGCGAATTAA AACAAGATTT ACCTGCGATA
TACTGCTACA AAGAAGCACT AAAAATACAA CCTATGAAGG GAGAAATTTG GTATGGTCAA
GGTCAAGCAT TATTAAATGT GCAAAAATAT GCTGAAGCAT TAGCAGCTTA TGATTGTGCT
GCGAAGCTGC AACCTGATAA TTATGATATT TGGTTTAAGA GAGGATTAGC TTTATTTCAA
ACTCAACGTT ATGCAGAAGC AGTTATCAGT TATGGCCACG CTATAGAATT ACAACCAGAG
AATTATCTAG GTTGGTTTAA CTTAGGTATT GCTCAAAGTA AACTACATAA ATATCACGAT
GCAGTCTCTT CTTTTAATAA GGCAATTAAA TTAAATCCTG ATGATTATGA AGCTTGGTAT
TATAAAGGAT TAGCTTTAAA AAATCATTGG AAAGAAGGAG GAGTTGCTTG TTTAGATAAG
GCAATTAATT TTAACCCTAA TTTACCAGAA ATTTGGATTA GTCGTGGTTA TATTTTATTA
GATTTATTTA AATATCGTGA GGCATTAGAG TCTTTTAATA AGGCAATTAC AATTAACTCT
AATTATCCCG AATCTTGGTT AGGTAGAGGT AAAGCATGGA TGGCTCTAGG TAAATATAAT
GAAGCTCTTA TTGCTTATGG TAATGCTGTT AGTATTGAGC CATATTTTTT AGAGGCTTGG
AATTGTCGAG GTGAAGCATT AGAAAGAGTC CAAAATTATG ATCAAGCATT GGCAGCTTAT
GACAAAGTGA TAAAAATGAG TTTTGAGCAA GGAGTTTCTG TTGCTAAAGT AGGTTTACAG
AGAGGAGCAG CTTTAGAAAA GTTAGAGCGA TATCCTGAAG CAATAGAAGC TTATAATTTG
GTAATTGAAA AACAACCAAA TAATTTTGAT GGTTGGTTAA ACCGGGGATT AAACTTAGAA
AAAATGGCAA ATTATGAAGA AGCTGTTTTG AGTTATAGTC GAGCTATTAG TATATGGCCT
AGTAATTATC AAGCTTGGTT ACAATTAGCT TTAATGCTGG AAAAATTAGA GAGGTTAGAT
GAAGCGATCG TTGCCTATAA CAAAATCATT TCTTTAAGGC CTGGTAATCA TGAAACTTGG
TTGAAAAGAG GATTAATTCT GGAAAGATTA GGATATGTTC AAGAAGCTGT TAGTTCTTAC
AAAATTGTAT TAGAAATTAA ACCTGACTAT CACGAAGCAA TTGAAAGAAA AAAACGATTA
GAATTAACTG TTTAA
 
Protein sequence
MSKTHAKSSE KLTFLTTLNT KNNQTISSSK GIFSIFSAFP EYDRGNRLYE MGRYESAIPY 
YENAVKIKPD WAIGWLKLAE ALSKLQKYEQ AVEAYKRSLS LKQNAHQAWH SYGVVLSNLK
QYEQAIACFD KAIKINPNDY QSWFNKAIIL SELKQDLPAI YCYKEALKIQ PMKGEIWYGQ
GQALLNVQKY AEALAAYDCA AKLQPDNYDI WFKRGLALFQ TQRYAEAVIS YGHAIELQPE
NYLGWFNLGI AQSKLHKYHD AVSSFNKAIK LNPDDYEAWY YKGLALKNHW KEGGVACLDK
AINFNPNLPE IWISRGYILL DLFKYREALE SFNKAITINS NYPESWLGRG KAWMALGKYN
EALIAYGNAV SIEPYFLEAW NCRGEALERV QNYDQALAAY DKVIKMSFEQ GVSVAKVGLQ
RGAALEKLER YPEAIEAYNL VIEKQPNNFD GWLNRGLNLE KMANYEEAVL SYSRAISIWP
SNYQAWLQLA LMLEKLERLD EAIVAYNKII SLRPGNHETW LKRGLILERL GYVQEAVSSY
KIVLEIKPDY HEAIERKKRL ELTV