Gene Tery_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1891 
Symbol 
ID4242696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2899102 
End bp2902572 
Gene Length3471 bp 
Protein Length1156 aa 
Translation table11 
GC content45% 
IMG OID638107012 
Productpeptidase-like 
Protein accessionYP_721620 
Protein GI113475559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0870492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGA CTTTTCAAGT TGAATTTAGT GGCACCTGGG GAGAGGCCGA GCCAGGGCCA 
AACCCGAATC CTAAAATTTA TCCTACAGAT CCTCCGAATA CGTTTCGTTG GGGAGAGTCG
GTAGAGGAAG ACCTGAAACC AAACAGCTTA CTCTTTAATG CGGAAGACAT TAGCGGGGGC
ATTAAAACTG ACCAGCAGTT TGAAGTCGGA AAGTTAACCT TCTTTAACGG TTCAATTTTT
TCGGATACAG CGGTAGAGTC AGTACCGCTG AAGATTGAAT TATTAGATCC TTTTGAATTA
TTAGATTCTC CCATAACTTT TACTTTCAAT CTGGACTTAG ACACTACGGA GGATACCTCT
GACGAGATAG ATGACTGGTC TGATTTCGTC TATTTTCCCA AAGTCTTACC CACGGAAACC
TTCAAATTTG ATGGACAAAC ATACACCTTA GAGCTAACAG GCTTTAGCCA AGATGGTGGT
GACACCCTAG TAAACCGCTT TCGGGTGATG GAACAAGAAG TAGACATAGC TAGCTTGTTC
GCGAAAATTA GACTAGCACC CAGAGATATT GAGCGTCTTG AAGACCCCAC AGCAAAAACG
GACGGTGCCA TTAATCTTGG CGATCTTAAC AGCCGGAAGA ACTACCGCAA TACCGACGAA
ATCGGTTTTA ATGAAGGTGG TGTCCGTGAC TTGCAAGATT TCTATAAGTT TACTCTCAGC
AAGGATAGTG AGGTTGACAT CACTCTAGAC CAACTCAAGC GCAATGCTAA CGTTGAAATT
CTAGATGAGG ATGGTAGCAC AGTACTTTTC CAGTCAACTG AAGAGAACCG GAAGCGGGAA
AACATTACCG AAAATTTAGA ATCAGGTGAT TATTTCATCC GTGTTTATCC TGAAGGAGAT
GATCGCACAA AATACCGCTT AGGGGTAAGT GCTGATGCCC TGACAGATGA AAAGGACACA
ACTGATACCG CTAAAGAACT AGGTAATATC GGACTCGAAG AAGTAACCGA AATTGACAGA
ATAGGCTTCG GTCGGGGTAA AAACCGGGAC CAAGAAGATT ACTATAAGTT TGGTATCAAT
GAAAAGAGTG ACTTTTTCCT TACCCTAGAC CAGCTAAAGG GAAATGCTAA TGTTGAGGTT
TTAGATGGGG ATGGCAGCAC TATTCTCTAC CAGTCTAACA ATAGCGGTCG TAAAAGAGAA
AAGATTAACG AGGAATTAGA ACCCGGTGAT TATTTTGTGC GCGTAACTCC CCAAGGTGCT
GCCAGGACAG ACTATCGCCT GGGTCTGAGC GCTGATGTGC TTTCCAATGA ACAAGATGAC
CAGGAACCAG GTATTAGCTT GGGGGCAGTT ACGGAGCTTA CTCCCGTCAG CAAAACCGGT
AAACTTGGTG CTAACAAAGA CAGGGTAGAC TGGTACAACT TTTCTGTGCC GATAGAGAGC
GATGTCAACC TGACTCTAGA TAGACTAAGG CAAGACCTTA ACCTGGAAAT CTATGATGAG
GGTGGCGAGC TAGTTGATGA TGGTAAAAAT ACAGGCCGCA AAGCTGAAAA GATAGAACTT
GAAGGGCTGG AACCAGGAAC CTATAACATA AAAGTTTTCC CAAATGGTGG CGCCAAGAGT
AACTATCGCT TGGGTATAAC TGCGACTGCT CCTTATGTTG ATGACTATGC TAGTGTTGAA
GAAGCTTTAG ACTTCGGGAA TATCCCTATT GGGGAGACAA GGGTCTTCAA CGGCGAAATG
GGCCGTACTG AAGGTCGTTC CGGTCGTGAC ACAGAAGATT GGGTCAGCTT TACAATTGAT
GAAGAGAGCT TGGTTGACAT CGACCTGACT CGTCTGCGTC AAAACATAGA TATGATCTTG
TACGATGACG ACGGAACCTT TCTCAATAAT TCTCGGAACA AGGGTCGTAA ATCAGAAAAC
ATTGCTGAGA TATTGGAGGC AGGAACTTAT CATGTCCAGA TATTGCCAAA GGGAAACAGT
CGCAGCAACT ATCGTTTTTC GGTGAATGCT GAACCGATCC CAGAACCGAG ACAAGAGTTT
ACAGTTGGTG ACCTTTTGTC TTTGGAGGAT GGTTACAGTA TCAGGGGCGA GGAAATCGGC
TTTACCTCTG GTGGTATTCG TAACGTTATT GATCGGCACT TATTCAGCAT AAGCGACGAG
AGGAATGTAG AAATTGACCT CACAGGGCTG AAAAGAAATG CCAACATTGC CTTGTACGAT
GATGATGGCA CTTTATTGCT TGAGTCTCGG AAAGGCGGTA GAAGGAACGA AAATATTAGC
GACACTCTGG ATCCAGGCGA TTATTATGTG GATGTAGAGC CCCAGAACCA AGCTAAAACT
AAGTATAATT TGGACATTTT TGCAAGTGGT TCGAGCGTCG ATCCAGATGG TGGTCCTGTG
CTAGAGACTT CTTTGTACAA TGATATCGGT AACCTCACTG AAGATTACAG TAAGATAGAT
AATGTCGGTT TCGGCAGTGG TAGCAGTCGG GACGAAGTAG ACTACTACAA GTTTGAACTA
AGCGAAGACA AGAATCTGAC TATCTCCCTA AATAAACTGA GCGCTGATAT CGATCTGGAA
TTGCTTGATA GTTCTGATAC TTTGATCAAA GATTCCCGCA ATAAGAAGAA GAAAAACGAA
AAAATTGAAG AAGAGCTTGA ACCAGGTACT TACTATGTAG GGGTTGAACC CAAAGGTAAC
GCTCGTGGTA ACTATACCCT AAATATTAAG GTTCCTGAGC CAGGCAGTAG CGTGGATGAG
GACGGAGGCA AACTTCCAGA AAACGTCACC GACATCGGCG TGTTAACCTC ATACGAAGAG
GAGGACTCCA TCGGTCGTGA AGAAAACAGT TATCGTGATG TCAACGACTA CCGCAAGTTT
ACTTTGAGCG CTAAGAGTAG TGTCGATATT AACCTTACAG GCCTTACTGG TAACGCCAAC
CTACAGCTAA TTGATGGCGA CGGTAGCACT GTGCTAAATA CTTCTGCTAA TGGTGGTAGT
GACGACGAAA AAATCAACCT TACTTTGGAT GCGGGCGACT ATTGTGTGCG AGTGTTTCCC
AGGGGTGCGG CTAAAACTGA CTACACTCTG AATATGAGTG CGAGTGAAAT TGGTGAAAGC
ATAGACAATG AGCCACCAGG GATAGCTCTC GGTACGGTTA CAGTTGGTGC TGATCCTCTT
ACCCAAGGCG GTGACCTCGG CTTCACGGAG GGAGGCGTAG TTGACACCAA GGACTATTAC
AGCTTTGATA TTACCCAGGC TGGTTTCGTA GATATCAAAC TTGACGACCT GAGCGATAAC
GCTGACCTGA AACTATACGA TGAGACTGGC GAAGTAGAAC TTGGCAGTTC TAATAACTCC
GGCAATACTT CTGAGGAGAT TAATAGCTTC TTGAGTGCCG ATACTACCTA TGTGGTGGGT
GTATTCGGTC TAGGTAATCA AACTCCTTAC GACCTGAGCA TTTCTCTCTG A
 
Protein sequence
MPETFQVEFS GTWGEAEPGP NPNPKIYPTD PPNTFRWGES VEEDLKPNSL LFNAEDISGG 
IKTDQQFEVG KLTFFNGSIF SDTAVESVPL KIELLDPFEL LDSPITFTFN LDLDTTEDTS
DEIDDWSDFV YFPKVLPTET FKFDGQTYTL ELTGFSQDGG DTLVNRFRVM EQEVDIASLF
AKIRLAPRDI ERLEDPTAKT DGAINLGDLN SRKNYRNTDE IGFNEGGVRD LQDFYKFTLS
KDSEVDITLD QLKRNANVEI LDEDGSTVLF QSTEENRKRE NITENLESGD YFIRVYPEGD
DRTKYRLGVS ADALTDEKDT TDTAKELGNI GLEEVTEIDR IGFGRGKNRD QEDYYKFGIN
EKSDFFLTLD QLKGNANVEV LDGDGSTILY QSNNSGRKRE KINEELEPGD YFVRVTPQGA
ARTDYRLGLS ADVLSNEQDD QEPGISLGAV TELTPVSKTG KLGANKDRVD WYNFSVPIES
DVNLTLDRLR QDLNLEIYDE GGELVDDGKN TGRKAEKIEL EGLEPGTYNI KVFPNGGAKS
NYRLGITATA PYVDDYASVE EALDFGNIPI GETRVFNGEM GRTEGRSGRD TEDWVSFTID
EESLVDIDLT RLRQNIDMIL YDDDGTFLNN SRNKGRKSEN IAEILEAGTY HVQILPKGNS
RSNYRFSVNA EPIPEPRQEF TVGDLLSLED GYSIRGEEIG FTSGGIRNVI DRHLFSISDE
RNVEIDLTGL KRNANIALYD DDGTLLLESR KGGRRNENIS DTLDPGDYYV DVEPQNQAKT
KYNLDIFASG SSVDPDGGPV LETSLYNDIG NLTEDYSKID NVGFGSGSSR DEVDYYKFEL
SEDKNLTISL NKLSADIDLE LLDSSDTLIK DSRNKKKKNE KIEEELEPGT YYVGVEPKGN
ARGNYTLNIK VPEPGSSVDE DGGKLPENVT DIGVLTSYEE EDSIGREENS YRDVNDYRKF
TLSAKSSVDI NLTGLTGNAN LQLIDGDGST VLNTSANGGS DDEKINLTLD AGDYCVRVFP
RGAAKTDYTL NMSASEIGES IDNEPPGIAL GTVTVGADPL TQGGDLGFTE GGVVDTKDYY
SFDITQAGFV DIKLDDLSDN ADLKLYDETG EVELGSSNNS GNTSEEINSF LSADTTYVVG
VFGLGNQTPY DLSISL