Gene Tery_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1974 
Symbol 
ID4244398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3064492 
End bp3069276 
Gene Length4785 bp 
Protein Length1594 aa 
Translation table11 
GC content44% 
IMG OID638107091 
Productpeptidase-like 
Protein accessionYP_721698 
Protein GI113475637 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACGT GTACAGGAGT AGATTGGTCT GGCACGCCAG GAGACAATGT GTGGCCTTTA 
TCTGAGGATA ATTCTTGTAA TGACACACTG AATGGTAGAG GGGGTGACGA CAAATTGTAT
GCTAGCCAAG GTAATGACAT ATTGTATGGT GGTGATGGTG ATGACAAATT GTATGGTCAG
CAAGGTAATG ACAAATTGTA CGGCGAGCAA GGTAATGACA GATTGTATGG CAATGGAGGT
CGCGACAGAT TGTATGGCAA TGGAGGTTGT GACAGATTGC ATGGTGGTCG TGGTAAAGAC
ACCCTTAATG GAGGAGCCGG CAACGATACC CTTAATGGAG GAGCCGGCAA GGACAAAGTG
TGGAAAGAAG AGAAGGATCG CGTAGTACCT GGTGATTCTG ACGTAATTTT TCTTCTTACT
TCTACTGGTA AAGGTCATAA AAAGAAAATC AGCAAAAATT CTAGTCCTGT TTTGAGAAAC
CCCGGCGGCA GTAAGAACAC CATAGCAGAT GCCTTCGACC TGGAACTCAT CGAACTAGGA
CCGACTGTCA TAGAAGATAA GATCGGTTTT AGAGTAAATA GTCAACTTGA TAGGAACGAC
TACTACAAAT TCACTCTTGA CGAGGAAAGC GACTTTAACC TAACCCTAGA AAATTTAAGT
GCTAATGCTG ATGTCGAAAT ATTAGATAGT GATGGCTCAA CATTATTATT CAGCTCTGCA
AACTCTGGGA ATACAGATGA ACTTATCAAT GCGGAATTAG ATGCAGGAGA ATATTTTATC
CGCGTCTTAC CAAGAGGCGG AGCAGAAACA GACTATAGAT TGACTATTAA TGCTGAACCC
AAAAATTCTA GTGCTCTTCT GGACAGTGCG GATCTTTTTA TTGGTGGTGA AGGGGCTGAT
AAATTTGTCC CCTTTTATGA TCAGGATGGT ACTTATGGAC ATGACACCAT AGAAGATTTT
AATAAGAGTG AAGGAGATAA GATACTTCTA AATGGTAAAA ATTTTGAGGC TTTAAAAGAT
ATAACTATTG GCGAAACTCT ACCAGAATAC GAGTTTATAG CAATTCCAAA CTATGGCTGT
AGGGGGAACA CACCTCAGAC CTCAGAAACA ATTATCTATG ATCCTGTCTC TGGCCTTGTA
TACGCTAATC AAACAGAAAA GCCCAATGAT GAGGTCATAC TTACTCAATT GACTAATAAA
AGCTCATTAG AAAACACTGA CTTTGAAATT GTTGATTCTG AAACAATAAT CCCTCCCAAT
GACCCCCCAA TGGACTGCAC TAGTACAGGT CGACCTGATA GGAGTTTTGA ACGGGGTTTT
CAAGTTGAAT TTAGTGGCAC CTGGGGAGAG GCCGAGCCAG GGCCAAACCC GAATCCTAAA
ACTTATCCTA CAGATCCTCC GAATACGTTT CGTTGGGGAG AGTCGGTAGA GGAAGACCTG
AAACCAAACA GCTTACTCTT TAATGCGGAA GATATTAGCG GGGGCATTGA AACTGACAAG
CAGTTTGAAG TCGGAAAGTT AACCTTCTTT AACGGTTCAA TTTTTTCGGA TACAGCGGTA
GAGTCAGTAC CCCTGGAAAT TAAATTAGAA TTAGATACTT TAGAAGGTGA ACCACCAATA
AACAAAACTT TTACCTTCAA TCTGGACTTA GACACTACGG AGGATACCTC TGACGAGATA
GATGACTGGT CTGATTTCGT CTATTTTCCC AAAGTCTTAC CCACGGAAAC CTTCAAATTT
AATGGACAAA CATACACCTT AGAGCTAACA GGCTTTAGCC AAGATGGGGG TGACACCCTA
GAAAACCGCT TTCGGGTGAT GGAACAAGAA GTAGACATAG CTAGCTTGTT CGCTAAAATT
AGACTAGCAC CCAAAGATAT TGAGCGTCTT GAAGACCCCA CAGCGAAGCC GGACGGTGCC
ATTAATCTTG GCGATCTTAA CAGCCGGAAG AACTACCGCA ATACTGACGA AATCGGTTTT
AATGAAGGTG GTGTCCGTGA CTTGCAAGAT TTCTATAAGT TTACTCTCAG CAAGGATAGT
GAGGTTGACA TCACTCTAGA CCAACTCAAG CGCAATGCTA ACGTTGAAAT TCTAGATGAG
GATGGTAGCA CAGTACTTTT CCAGTCAACT GAAGAGAACC GGAAGCGGGA AAACATTACC
GAAAATTTAG AATCAGGTGA TTATTTCATC CGTGTTTATC CTGAAGGAGA TGATCGCACA
AAATACCGCT TAGGGGTAAG TGCTGATGCC CTGACAGATG AAAAGGACAC AACTGATACC
GCTAAAGAAC TAGGTAATAT CGGACTCGAA GAAGTAACCG AAATTGACAG AATAGGCTTC
GGTCGGGGTA AAAACCGGGA CCAAGAAGAT TACTATAAGT TTGGTATCAA TGAAAAGAGT
GACTTTTTCC TTACCCTAGA CCAGCTAAAG GGAAATGCTA ATGTTGAGGT TTTAGATGGG
GATGGCAGCA CTATTCTCTA CCAGTCTAAC AATAGCGGTC GTAAAAGAGA AAAGATTAAC
GAGGAATTAG AACCCGGTGA TTATTTTGTG CGCGTAACTC CCCAAGGTGC TGCCAGGACA
GACTATCGCC TGGGTCTGAG CGCTGATGTG CTTTCCAATG AACAAGATGA CCAGGAACCA
GGTATTAGCT TGGGGGCAGT TACGGAGCTT ACTCCCGTCA GCAAAACCGG TAAACTTGGT
GCTAACAAAG ACAGGGTAGA CTGGTACAAC TTTTCTGTGC CCATAGAGAG CGATGTCAAC
CTGACTCTAG ATAGACTAAG GCAAGACCTT AATGTCGAAA TTTACGATGA TGGTGGCGAG
CTAGTTGATG AGGGTAAAAA GACAGGTCGC AAAGCTGAAA AGATCGAACT TGAAGGGCTG
GAACCAGGAA CCTATAACAT AAAAGTTTTC CCAAATGGTG GCGCCAAGAG TAACTATCGC
TTGGGTATAA CTGCGACTGC TCCTTATGTT GATGACTATG CTAGTGTTAA AGAAGCTTTA
GACTTCGGGA ATATCCCTAT TGGGGAGACA AGGGTCTTCA ACGACGAAAT GGGCCGTACT
GAAGGTCGTT CCGGTCGTGA CACAGAAGAT TGGGTCAGCT TTACAATTGA TGAAGAGAGC
TTGGTTGACA TCGACCTGAC TCGTCTGCGT CAAAACATAG ATATGATCTT GTACGATGAC
GACGGCACAA CCACTCTCAA TAATTCTCGG AACAAGGGTC GTAAATCAGA AAACATTGCT
GAGATATTGG AAGAGGGGAC TTATCATGTC CAGATATTGC CAAAGGGAAA CAGTCGCAGC
AACTATCGTT TTTCGGTGAA TGCTGAACCC ATCCCAGAAC CGAGACAAGA GTTTACAGTT
GGTGACCTTT TGTCTTTGGA GGATGGTTAC AGTATCAGGG GCGAGAAAAT CGGCTTTACC
TCTAGTGGCG TTCGTAACCT TATTGATCGG CACTTATTCA GCATAAGCGA CGAGAGGAAT
GTAGAAATTG ACCTCACAGG GCTGAAAAGA AATGCCAACA TTGCCTTGTA CGATGATGAT
GGCACTTTAT TGCTTGAGTC TCGGAAAGGC GGTAGAAAGA ACGAAAATAT TAGCGACACT
CTGGATCCAG GCGATTATTA TGTGGATGTA GAGCCCCAGA ACTTGGCTAA AACTAAGTAT
AATTTGGACA TTTTTGCAAG TGGTTCGAGC GTCGATCCAG ATGGTGGTCC TGTGCCAGAG
ACTTCTTTGT ACAATGACAT CGGTAACCTC ACTGAAGATT ACAGTAAGAT AGATAATGTC
GGTTTCGGCA GTGGTAGCAG TCGGGACGAA GTAGACTACT ACAAGTTTGA ACTAAGCGAA
GACAAGAATC TGACTATCTC CCTAAATAAA CTGAGCGCTG ATATCGATCT GGAATTGCTT
GATAGTTCTG GTACTTTGAT CAAAGATTCC CGCAATAAGA AGAAGAAAAA CGAAAAAATT
GAAGAAGAGC TTGAACCAGG TACTTACTAT GTAGGGGTTG AACCCAAAGG TAACGCTCGT
GGTAACTATA CCCTAAATAT TAAGGTTCCT GAGCCAGGCA GTAGCGTGGA TGAGGACGGA
GGCAAACCCC CAGAAAACGT CACCGACATC GGTGTGTTAA CCTCATACGA AGAGGAGGAC
TCCATCGGTC GTCGAGAAAA GAGTTATCGT GATGTCAACG ACTACCGCAA GTTTACTTTG
AGCGCTGAGA GTAGTGTCGA TATCAACCTT ACTGACCTGA AGGGTAACGC CAACCTACAG
CTAATTGATA GCGACGGTAG TACTGTGCTA AATACTTCTG CTAATGGTGG TAGGAAGGAC
GAAAACATCA ACCTTACTTT GGAAGCGGAC GACTATTATG TGCGAGTGTT TCCCAGGGGT
GCGGCTAAAA CTAACTACAG TCTGAATATG AGTGCGAGTG AAATTGGTGA AAGCATAGAC
AATGAGCCAC CAGGGATAGC TCTCGGTACG GTTACAGTTG GTGCTGATCC TCTTACCCAA
GGCGGTGACC TCGGCTTCAC GGAGGGGGGC GTAGTTGACA CCAAGGACTA TTACAGCTTT
GATATTACCC AGGCTGGTTT CGTAGAGATC AAACTTGACG ACCTGAACGA TAACGCTGAC
CTGAAACTAT ACGATGAGAC TGGCGAAGTA GAAATTGGCA GTTATACTAA CTCCGGCAAT
ACTCCTGAAG AGATTTTCAC CTTCATTAGT GCCGATACTA CCTATGTGGT GGGTGTATTC
GGTCTAGGTA ATCAAACTTT TTACGACCTG AGCATTTCTC TCTGA
 
Protein sequence
MPTCTGVDWS GTPGDNVWPL SEDNSCNDTL NGRGGDDKLY ASQGNDILYG GDGDDKLYGQ 
QGNDKLYGEQ GNDRLYGNGG RDRLYGNGGC DRLHGGRGKD TLNGGAGNDT LNGGAGKDKV
WKEEKDRVVP GDSDVIFLLT STGKGHKKKI SKNSSPVLRN PGGSKNTIAD AFDLELIELG
PTVIEDKIGF RVNSQLDRND YYKFTLDEES DFNLTLENLS ANADVEILDS DGSTLLFSSA
NSGNTDELIN AELDAGEYFI RVLPRGGAET DYRLTINAEP KNSSALLDSA DLFIGGEGAD
KFVPFYDQDG TYGHDTIEDF NKSEGDKILL NGKNFEALKD ITIGETLPEY EFIAIPNYGC
RGNTPQTSET IIYDPVSGLV YANQTEKPND EVILTQLTNK SSLENTDFEI VDSETIIPPN
DPPMDCTSTG RPDRSFERGF QVEFSGTWGE AEPGPNPNPK TYPTDPPNTF RWGESVEEDL
KPNSLLFNAE DISGGIETDK QFEVGKLTFF NGSIFSDTAV ESVPLEIKLE LDTLEGEPPI
NKTFTFNLDL DTTEDTSDEI DDWSDFVYFP KVLPTETFKF NGQTYTLELT GFSQDGGDTL
ENRFRVMEQE VDIASLFAKI RLAPKDIERL EDPTAKPDGA INLGDLNSRK NYRNTDEIGF
NEGGVRDLQD FYKFTLSKDS EVDITLDQLK RNANVEILDE DGSTVLFQST EENRKRENIT
ENLESGDYFI RVYPEGDDRT KYRLGVSADA LTDEKDTTDT AKELGNIGLE EVTEIDRIGF
GRGKNRDQED YYKFGINEKS DFFLTLDQLK GNANVEVLDG DGSTILYQSN NSGRKREKIN
EELEPGDYFV RVTPQGAART DYRLGLSADV LSNEQDDQEP GISLGAVTEL TPVSKTGKLG
ANKDRVDWYN FSVPIESDVN LTLDRLRQDL NVEIYDDGGE LVDEGKKTGR KAEKIELEGL
EPGTYNIKVF PNGGAKSNYR LGITATAPYV DDYASVKEAL DFGNIPIGET RVFNDEMGRT
EGRSGRDTED WVSFTIDEES LVDIDLTRLR QNIDMILYDD DGTTTLNNSR NKGRKSENIA
EILEEGTYHV QILPKGNSRS NYRFSVNAEP IPEPRQEFTV GDLLSLEDGY SIRGEKIGFT
SSGVRNLIDR HLFSISDERN VEIDLTGLKR NANIALYDDD GTLLLESRKG GRKNENISDT
LDPGDYYVDV EPQNLAKTKY NLDIFASGSS VDPDGGPVPE TSLYNDIGNL TEDYSKIDNV
GFGSGSSRDE VDYYKFELSE DKNLTISLNK LSADIDLELL DSSGTLIKDS RNKKKKNEKI
EEELEPGTYY VGVEPKGNAR GNYTLNIKVP EPGSSVDEDG GKPPENVTDI GVLTSYEEED
SIGRREKSYR DVNDYRKFTL SAESSVDINL TDLKGNANLQ LIDSDGSTVL NTSANGGRKD
ENINLTLEAD DYYVRVFPRG AAKTNYSLNM SASEIGESID NEPPGIALGT VTVGADPLTQ
GGDLGFTEGG VVDTKDYYSF DITQAGFVEI KLDDLNDNAD LKLYDETGEV EIGSYTNSGN
TPEEIFTFIS ADTTYVVGVF GLGNQTFYDL SISL