Gene Tery_2617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2617 
Symbol 
ID4244685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4041422 
End bp4043224 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content41% 
IMG OID638107686 
ProductGUN4-like 
Protein accessionYP_722285 
Protein GI113476224 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000986346 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCTC TACTGCCTAT TATCAGTATA ACTTTCTTAT CAATAACTCT GACAAATTTT 
TCTTCCTCTG TCAAAAAAAC AGAGGTCGAA GACATAGCTT CTCGACTAGA GGATAGTATT
GTTAAACTAT CTTATAAAAA TCAACCAGGA GATGGAACTG GTTTTTTTGT AGAGGTGGAA
GGAAAGTCAG AAGTCTGTAG TGTACTAACA GCAGCTCATG TTGTGGAAAA AGAAGGACAA
AAAATCTTAT GGACTGAAAA AGATGAAAAA GTCTGGGATG TTGCTACGGT AGAAAGATTT
CCTGGTAGTA TAGACTTGGC TTTGATTACT TTTAAGCCAC ACACAAAAAG ATGTAATTAT
CCCGCACTAA AAATAGGTAA ACCAGAAAAC CTGAAGATAG GTAGTTCTAT TTTTGTTTAT
GGTTTTCCTC GTCTGGATCA ACATTTAGTG GCCCAGTTTG TTGGGGGTCA GGTTTCAGCT
TTGAAGAAAA AAGCCCGGGG TTATGGGGTT GCTTATAAAG CTTTGACTGT TGAAGGAATG
AGTGGAGCCC CTGTTGTGGA TACAAAGGGT GAGGTTGTGG CAGTCCATGG AAGTAGTAAC
TCTAAAATGG TGCCAAGTTT GATATCTCAG CAGATAAGTA GGCCTGATAT AGAATGGCAG
CTTGATCGGC AAACTTTCAA TAGAATTAAT AATAGTAGTT TGACCTGTGC TTGGGGTGTA
CCTATTAATT ATTTTCGGGA GTCTAAGTTT TATAATGCTA AGTTATATAG TAATCTTCTA
CCTTTAGACT TGAATAAGTG GATACTTTCG ATATTTTCCA TTAGTGGGTT GATGTTTAGC
TCTGGTATTG TTTCTTTTCG GTTCAAGCGT TTTCAAGCTT CACCAGTTTC GGGACAAGAA
CAAAATGAGC AGGAAAGAGA GTTTGAGGAT GTAGTATTTA GGGGAGAAGA AGAGCACAGA
CCGTTGAGTT CCCCGGCAAA TATTCCGACT CAGCTACAAA ATGAGCAGGA AAGAGAGTTT
GAGGATGTAG TATTTAGGGG AGAAGAAGAG CGCAGACCGT TGAGTTCCCC GGCAAATATT
CAGACTCAGC TACAAAATGA GCAGGAAAGA GAGTTTGAGG ATGTAGTATT TAGGGGAGAA
GAAGAGTACA GACCGTTGAG TTCCCTGGCA AATATTCCGA CTCAGCTACA AAATGAGCAG
GAAAGAGAGT TTGAGGATGT AGTATTTAGG GGAGAAGAAG AGCACAGACC GTTGAGTTCC
CTGGCAAATA TTCAGACTCA GGAGCAGCAA AGGCAAGGAG GGGTTCAGGT TCTTGAACCT
CCGTCTTCTG TGGTTTCTGT GCCTCTGGTT TCTGCAGCTG GAGTTGATTA TACTAGGTTG
CATGAGTTAT TGGTGGCTAA AAGGTGGAAG GAAGCAGACG ATGAAACATA TCAAAGAATG
ATAGAAGTGG CGGACCGGAA GTCCCAAGGA TGGTTGAGAA TTGAGGATAC AAAGAATTTT
CCTAGTCAAG ATTTAGGGAT TATTGATAAG CTATGGCTCA GATATAGTAA TGGTATGTTT
GGTTTTTCTG TTCAGAAGCA AGTTTATCAG AGTTTGGGTG GTACCCAGAG GTATAATCCA
AAAGTAGTAG AGGATTTCGG AGATAAGGTG GGATGGCGCC TGGAGGGAAA ATGGTTGAGT
TATGATGGTT TGACTGTGAG TGATAATTAT TACAGGGGAC ACCTGCCGTG TTGTGGGAAT
GAGGGGTCTT TGTATGGCTG GGCAGCAGTT CTTTGGTCTC TTCTCTCTCA TAAAGATTTG
TAA
 
Protein sequence
MKSLLPIISI TFLSITLTNF SSSVKKTEVE DIASRLEDSI VKLSYKNQPG DGTGFFVEVE 
GKSEVCSVLT AAHVVEKEGQ KILWTEKDEK VWDVATVERF PGSIDLALIT FKPHTKRCNY
PALKIGKPEN LKIGSSIFVY GFPRLDQHLV AQFVGGQVSA LKKKARGYGV AYKALTVEGM
SGAPVVDTKG EVVAVHGSSN SKMVPSLISQ QISRPDIEWQ LDRQTFNRIN NSSLTCAWGV
PINYFRESKF YNAKLYSNLL PLDLNKWILS IFSISGLMFS SGIVSFRFKR FQASPVSGQE
QNEQEREFED VVFRGEEEHR PLSSPANIPT QLQNEQEREF EDVVFRGEEE RRPLSSPANI
QTQLQNEQER EFEDVVFRGE EEYRPLSSLA NIPTQLQNEQ EREFEDVVFR GEEEHRPLSS
LANIQTQEQQ RQGGVQVLEP PSSVVSVPLV SAAGVDYTRL HELLVAKRWK EADDETYQRM
IEVADRKSQG WLRIEDTKNF PSQDLGIIDK LWLRYSNGMF GFSVQKQVYQ SLGGTQRYNP
KVVEDFGDKV GWRLEGKWLS YDGLTVSDNY YRGHLPCCGN EGSLYGWAAV LWSLLSHKDL