Gene Tery_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1371 
Symbol 
ID4245471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2102236 
End bp2104170 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content36% 
IMG OID638106544 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_721155 
Protein GI113475094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAA TAGCACCATT AGGTTCTTGG AACTCTCCCA TAACTACTGA CTTAATTCTA 
TCCGGGGCAA TTGGACTTAG TAGTATCACC ATTGATGGTA ATAATGTTTA CTGGATTGAA
GGACGACCTT CAGAGGGTGG ACGAAATGTA ATTGTGCGCT ATACTCCAGA TGGAAAGACA
ACTGATATTA CTCCTTCACC TTTTAATGTA CGCACCCGCG TCCATGAATA CGGCGGTGGA
TCCTTCTTAG TGGCAGATGA TACGATATAT TTTTCTAATT TCAAGGACCA ACGTCTTTAT
CGCCAAACAC CAGGTACAGA ACCTCAACCT TTAACCCCAT CAGCAGACCT ACGTTATGCT
GATGCAGTAA TAGATAAACA GCGCGATCGC CTAATTTGTG TTCAAGAAGA TCATACAAAG
GATGGGGAGC CTACTAATAG GATAGTTAGC ATTAACCTCA AAAATGGAGA AGATATCCAG
GTATTAGCAG AGGGTTATGA TTTTTATGCC TCACCACGTT TAAGTCCTGA TGGTTCCACG
CTTTGCTGGA TTAGTTGGAA TCATCCAAAT ATGCCTTGGG ATGGTACAGA GTTATGGGTG
GCTCAGGTGA ATACAGATGG TTTATTAGAT GAAAATAAGT TCGTGGCTGG AGGAAAAGAA
GAATCAATTT TTCAACCCCA GTGGTCACCA GATGGAATGT TATGTTTTGT GAGCGATCGC
TCTTTATGGT GGAATATTTA TCAAGTTTCA GGAATTACTG ACAAGGTAAA TTTAGATATA
TTGTATTCTT TAAATGCTGA GTTTGGAGTG CCACAATGGT TATTTGGAAT GTCTACTTAT
ACATTTACAG AAGCTAACAA AATCCTCTGC ACTTTTTCTC AAAATGGAAT TTGTAATTTG
GCAACTATTG ATACTACTAA AAAACATCTA CAGAAAATAG AAATACCATT CACTTCTATT
AGTTATTTAA CAGCAAAAAA TAACAAAGTT TGTTTTTTGG GAAGTTCCCC CACAGAAGCT
AGTTCAATTA TACAAATAAA TCTATCTACA GGTGATATTA ATATTTTAAA ACGTTCTACA
GATTTAAAAA TTGATTCTGG TTATTTATCT ATTCCTAAAA CAATAGAATT TCCTACAGAA
AATGGTAAGA CTGCTTATGG TTTATTTTAC CCGCCGACTA ACAAAGATTA CACAGAACCT
TTAGGAGAAA AACCACCTCT TTTAGTTAAA AGTCATGGTG GTCCAACAGC AGCAACTTCT
GGTAGTTTAA GCTTAAAAAT TCAATATTGG ACAAGTCGTG GTTTTGCTTT ACTTGATGTT
AATTATGGAG GTAGTACTGG ATATGGAAGA GAGTATCGGC AAAGACTCAA AAATAGTTGG
GGTATTGTTG ATGTTGATGA TTGTGTAAAT GGAGCTCAAT ATTTAGCTAA ACAGGGATTA
GTAGATAGTA ATCGCATGGC TATTTCTGGT GGTAGTGCTG GAGGTTATAC TACTTTATGT
GCCTTAACTT TTAAGGATGT ATTTAAAGCA GGAGCAAGTT ATTATGGAGT TAGTGATTTA
GAGGCTTTAG CAACAGATAC TCATAAGTTT GAATCCCGTT ATTTAGATGG ATTAATAGGA
CCTTATCCTG AGAAAAAAGA GATTTATAAA CAGCGATCGC CAATTAATTT TACTGAGAGT
TTGTCTTGCC CTGTAATTTT TTTCCAAGGA TTAGAAGATA AAATTGTACC ACCAAATCAA
GCAGAGAAAA TGGTAGAAGT TTTACAGAAA AAAGGATTGC CAGTGGCTTA TGTTGCTTTT
GAAGGAGAAC AACATGGTTT TCGTAGTTCC GAGAATATTA AACGTGCTTT GGATGGAGAA
TTTTACTTTT ACTCTCGTGT ATTTGGATTT ACACCTGCTG ATAATTTAGA GGAGCTAGAA
ATTATGAATC TTTAG
 
Protein sequence
MTQIAPLGSW NSPITTDLIL SGAIGLSSIT IDGNNVYWIE GRPSEGGRNV IVRYTPDGKT 
TDITPSPFNV RTRVHEYGGG SFLVADDTIY FSNFKDQRLY RQTPGTEPQP LTPSADLRYA
DAVIDKQRDR LICVQEDHTK DGEPTNRIVS INLKNGEDIQ VLAEGYDFYA SPRLSPDGST
LCWISWNHPN MPWDGTELWV AQVNTDGLLD ENKFVAGGKE ESIFQPQWSP DGMLCFVSDR
SLWWNIYQVS GITDKVNLDI LYSLNAEFGV PQWLFGMSTY TFTEANKILC TFSQNGICNL
ATIDTTKKHL QKIEIPFTSI SYLTAKNNKV CFLGSSPTEA SSIIQINLST GDINILKRST
DLKIDSGYLS IPKTIEFPTE NGKTAYGLFY PPTNKDYTEP LGEKPPLLVK SHGGPTAATS
GSLSLKIQYW TSRGFALLDV NYGGSTGYGR EYRQRLKNSW GIVDVDDCVN GAQYLAKQGL
VDSNRMAISG GSAGGYTTLC ALTFKDVFKA GASYYGVSDL EALATDTHKF ESRYLDGLIG
PYPEKKEIYK QRSPINFTES LSCPVIFFQG LEDKIVPPNQ AEKMVEVLQK KGLPVAYVAF
EGEQHGFRSS ENIKRALDGE FYFYSRVFGF TPADNLEELE IMNL