Gene Tery_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2471 
Symbol 
ID4245240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3799246 
End bp3803691 
Gene Length4446 bp 
Protein Length1481 aa 
Translation table11 
GC content42% 
IMG OID638107555 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_722154 
Protein GI113476093 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.420848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAA AAAATTTAAG GCAAGCTTTA GTCGTTGGAA TTAACCGTTA TCCTTTGTTG 
AAAAAGAAAA AGTTAGGAGA TTTGAACCTT AAGGCTGCTG TTAAGGATGC TGAAGCTATT
GCTAATATTT TGGAAAAATA TGGTAAGTTT CGAATTCAAC GTTTGCCAAG TTTGCCAAGT
TTGCCAAAAA ACTATGATCA GGAAGGTACA GAAAGATTTG ACCCTAAGGG AAAGGTGAAA
ATTAATGAGC TTCAAGAAGC AATTATTAAT CTATTTAAAC CTCGTAAAAA AAATGAAACT
CCTGATGTGG CTTTACTGTT TTTTGCCGGA CATGGTTATG TGGATGAAAA GGGAGATATT
CGAGAAGGTT TTTTGGCTAC TAGTGAGGCT CACCTTTCGG AAAATGTTTA TGGTATTTCT
TTAAATTGGT TAAAAAGGCT CTTACAAGAT AGTCCGGTTC AAGAACAAAT TGTTTGGTTA
GACTGTTGTT TTAGTGGTGA ATTCTTGAAT TTTGACCGAG AAGCTAACCC TGGTACTGAA
GGAAAAAAAA TTAGTCGTTG TTTTATTACT GCTTCTCGTT CTTTTCAAAC TGCGGAAGAA
AAATTGGATG GAAAACATGG TTTATTTACT GATAATTTGC TGGCTGGTTT AAACCCAGAA
AATTATGTTG ATGGTTGGGT AACTAATTAT GTTTTGGCAG AGTTTATTAA CAAAAAGATG
TCTCGTACAT CTCAAGCACC GATGTTTCAG AATTCTGGGG ATGCGATAAT TCTTACTACT
AATACTCTGA CTGAATATAA GGATGAACGC TGGAAAAATT TGGCTCCTTA TCGAGGTTTA
TCTTATTTTC GTCAGCAGGA AAATGATGCG GTTTTTTTTC ATGGCAGAAC TCTTTTAACT
GATGAGTTAA TTGACCGAGT TAGAACTAAT AATTTTGTGG CGGTTTTGGG GGCTTCTGGA
AGTGGAAAGT CTTCTTTATT GCGGGCGGGT TTGTTATATC AACTGAGGCA GGGGCAAAAA
ATATCTGGAA GTGATAGATG GCGTTATCTA AATCCTTTTA CTCCGACTTT TTCTCCTCTG
AAAAGTTTGG AACTAGCAAT TAATAGAGAG GGGGAAAAAC AGGAGAATTT TACTGATAAT
TTTACTGATA ATTTAATTAG GTTTATTGAC CTAGTTGAAG CTGAAAGAGT GGTAATGATA
ATTGACCAGT TTGAAGAGGT TTTTACTCTC TGTCAAGGTG ACGAGGAAAA AGAACAAGAA
CGACTGGATT TTTTTGATTG TTTTTTGGAT GTTCTGGAAA GACGAGGTGA TAAGTTTTGT
TTGGTGTTGG GAATGCGGGC TGATTTTCTT GACCGCTGTT CTGAATATGG AAGGTTAGCT
AATCAAATTA AGCGGCATCA ATTATTAGTA ACTCCTCTGG AAAAGGATGA AATAGATGAG
GTTATAAAAA AGCCTGCGGA ATTAGTTGGA GTTGGAGTTG AGCCGGGATT AATTGCCCAA
ATAAGGGAAG ATTTTTTGCG TAACCCTGGT AGTTTACCTC TGCTGGAATA TACTTTGGAT
GCTCTGTGGA AGTTTGCGAC TCAAGGGGAG AATAAAAGTC AATTTTTAAC TCTGGCAACT
TATACAAAGT TGGGTGGAAT TAAGGGTACT TTGACGAAGC GAGCTGATGC AGTTTTTCAG
AGTTTGAATG ATGAGGAAAG GTCGGTGGCA AAGAGGATAT TTTTAGAGTT AGTTCAGCCT
GGGGAAAAGG AAATAAGTTC GGGGAAAATA ACGGATACTC GGCGGCGAGT AATTTTGGAA
AAGTTGCCTA ACAAAAGACA TAGTTTAGAG CTTTTATCTG CAGTTAGTGA TAGGTTAGCT
GATCCAAATA ATCGGTTAAT TACTAAGGAT AATTCCGAGG GAGGAATATT ATTAGATATT
GTTCATGAGG ACTTAATTAG AAGTTGGAAA ACTTTGAGGG AATGGGTGGA AGAATATCAA
GAAGCTTTGC CGGTGGAAAG AAAAATTGAG GCTGATGCGG CTGAGTGGAA AAAAGATGGA
AAAAATGAGG GTTTGTTATT ACGAGCTGGT CGGTTAACTA AGGCAGAAGA ATATCTGAAA
AAATATGATG AAATGGCTTT ATTGGATGGG GTTGCTTATG AGTTTATTGA GGCGAGTCGG
GAGTTGAAAA TTCGTGAGGA AGAAAAAGAG AAGGAGCGGC AAAGAAAAGT AGAGGAACAA
GCAGCAAGAA TATTAGGAAT GCTTTCTGAC TCAATGATCA GACAAAAGCC ATCTTTACTT
GATAAAGGTG TATTACTGGG CATAGAGTCA ATGAAGCAAT ACTTTGATAT CAAAAAACGT
TATGGGAAAG TTGACTCGGA CCTACTTTTT GAACTGGATC AAACTCTACG GAATGGAGTA
AGTCAACTAC CCAAGCATCT CTATACTCTC AAACACCAGT CCGATGTATA TGCAGTAGCC
TTTAGCCCCG ACGGCAAAAC CATTGCTACT GCAAGTTATG ACAAAACCGC CCGCCTCTGG
GATACTGAGA ATGGCAAAGA ATTAGCTACT CTCAAACACC AGTCCGATGT ATATGCAGTA
GCCTTTAGCC CCGACGGCAA AACCATTGCT ACTGCAAGTT CTGACAAAAC CGCCCGCCTC
TGGGATACTG AGAATGGCAA AGAATTAGCT ACTCTCAACC ACCAGTCCTC GGTAAATGCA
GTAGCCTTTA GCCCCGACGG CAAAACCATT GCAACTGCAA GTTCTGACAA AACCGCCCGC
CTCTGGGATA CTGAGAATGG CAATGTATTA GCTACTCTCA ACCACCAGTC CTCGGTAAAT
GCAGTAGCCT TTAGCCCCGA CGGCAAAACC ATTGCAACTG CAAGTTCTGA CAAAACCGCC
CGCCTCTGGG ATACTGAGAA TGGCAAAGAA TTAGCTACTC TCAACCACCA GTCCTCGGTA
AATGCAGTAG CCTTTAGCCC CGACGGCAAA ACCATTGCAA CTGCAAGTTC TGACAAAACC
GCCCGCCTCT GGGATACTGA GAATGGCAAA GAATTAGCTA CTCTCAACCA CCAGTCCTGG
GTAAATGCAG TAGCCTTTAG CCCCGACGGC AAAACCATTG CAACTGCAAG TTCTGACAAA
ACCGCCCGCC TCTGGGATAC TGAGAATGGC AATGTATTAG CTACTCTCAA CCACCAGTCC
TCGGTAAATG CAGTAGCCTT TAGCCCCGAC GGCAAAACCA TTGCAACTGC AAGTTCTGAC
AAAACCGCCC GCCTCTGGGA TACTGAGAAT GGCAAAGAAT TAGCTACTCT CAACCACCAG
TCCTCGGTAA ATGCAGTAGC CTTTAGCCCC GACGGCAAAA CCATTGCAAC TGCAAGTTCT
GACAAAACCG CCCGCCTCTG GGATACTGAG AATGGCAAAG AATTAGCTAC TCTCAACCAC
CAGGACACGG TAAGAGCAGT AGCCTTTAGC CCCGACGGCA AAACCATTGC TACTGCAAGT
TCTGACAAAA CCGCCCGCCT CTGGGATACT GAGAATGGCA ATGTATTAGC TACTCTCAAC
CACCAGTCCT CGGTAATAGC AGTAGCCTTT AGCCCCGACG GCAAAACCAT TGCTACTGCA
AGTTCTGACA AAACCGCCCG CCTCTGGGAT ACTGAGAATG GCAATGTATT AGCTACTCTC
AACCACCAGT CCTCGGTAAT AGCAGTAGCC TTTAGCCCCG ACGGCAAAAC CATTGCTACT
GCAAGTTCTG ACAAAACCGC CCGCCTCTGG GATACTGAGA ATGGCAAAGT ATTAGCTACT
CTCAACCACC AGTCCAGGGT AAATGCAGTA GCCTTTAGCC CCGACGGCAA AACCATTGCA
ACTGCAAGTG ATGACAAAAC CGCCCGCCTC TGGGATACTG AGAATGGCAA TGTATTAGCT
ACTCTCAACC ACCAGGACTG GGTATTTGCA GTAGCCTTTA GCCCCGACGG CAAAACCATT
GCAACTGCAA GTTCTGACAA AACCGCCCGC CTCTGGGATA CTGAGAATGG CAATGTATTA
GCTACTCTCA ACCACCAGGA CTGGGTATTT GCAGTAGCCT TTAGCCCCGA CGGCAAAACC
ATTGCTACTG CAAGTTCTGA CAATACCGCC CGCCTCCATT GGGCAACACC AGAAGGCTTA
ATTCAAGAAG GTTGTCGCCG CCTCAGTCGG AATTTAACAG CAGAGGAATG GCAGCAGTAT
ATCAACAGCG ACTTGGAGAC ATATCAGAAA ACTTGCAAAA ATATTCCCGT TCATCCTAGT
TTAATTGCAG AAGCTAAAAA TCTTGCAAAA ACAGGGGAAA AACCGAAAAT CAAACAGGCA
ATTTCTATCT TCAAAAAAGC TCTAGAATTG GAACCAGAAA TTGACCTCGA TCCTGATACA
AAAACTAGAG AAACAGACCC CCAACTTGTA GCAAATAAAC TTGCTGCTTC TGCCAAATTA
AAATAA
 
Protein sequence
MDKKNLRQAL VVGINRYPLL KKKKLGDLNL KAAVKDAEAI ANILEKYGKF RIQRLPSLPS 
LPKNYDQEGT ERFDPKGKVK INELQEAIIN LFKPRKKNET PDVALLFFAG HGYVDEKGDI
REGFLATSEA HLSENVYGIS LNWLKRLLQD SPVQEQIVWL DCCFSGEFLN FDREANPGTE
GKKISRCFIT ASRSFQTAEE KLDGKHGLFT DNLLAGLNPE NYVDGWVTNY VLAEFINKKM
SRTSQAPMFQ NSGDAIILTT NTLTEYKDER WKNLAPYRGL SYFRQQENDA VFFHGRTLLT
DELIDRVRTN NFVAVLGASG SGKSSLLRAG LLYQLRQGQK ISGSDRWRYL NPFTPTFSPL
KSLELAINRE GEKQENFTDN FTDNLIRFID LVEAERVVMI IDQFEEVFTL CQGDEEKEQE
RLDFFDCFLD VLERRGDKFC LVLGMRADFL DRCSEYGRLA NQIKRHQLLV TPLEKDEIDE
VIKKPAELVG VGVEPGLIAQ IREDFLRNPG SLPLLEYTLD ALWKFATQGE NKSQFLTLAT
YTKLGGIKGT LTKRADAVFQ SLNDEERSVA KRIFLELVQP GEKEISSGKI TDTRRRVILE
KLPNKRHSLE LLSAVSDRLA DPNNRLITKD NSEGGILLDI VHEDLIRSWK TLREWVEEYQ
EALPVERKIE ADAAEWKKDG KNEGLLLRAG RLTKAEEYLK KYDEMALLDG VAYEFIEASR
ELKIREEEKE KERQRKVEEQ AARILGMLSD SMIRQKPSLL DKGVLLGIES MKQYFDIKKR
YGKVDSDLLF ELDQTLRNGV SQLPKHLYTL KHQSDVYAVA FSPDGKTIAT ASYDKTARLW
DTENGKELAT LKHQSDVYAV AFSPDGKTIA TASSDKTARL WDTENGKELA TLNHQSSVNA
VAFSPDGKTI ATASSDKTAR LWDTENGNVL ATLNHQSSVN AVAFSPDGKT IATASSDKTA
RLWDTENGKE LATLNHQSSV NAVAFSPDGK TIATASSDKT ARLWDTENGK ELATLNHQSW
VNAVAFSPDG KTIATASSDK TARLWDTENG NVLATLNHQS SVNAVAFSPD GKTIATASSD
KTARLWDTEN GKELATLNHQ SSVNAVAFSP DGKTIATASS DKTARLWDTE NGKELATLNH
QDTVRAVAFS PDGKTIATAS SDKTARLWDT ENGNVLATLN HQSSVIAVAF SPDGKTIATA
SSDKTARLWD TENGNVLATL NHQSSVIAVA FSPDGKTIAT ASSDKTARLW DTENGKVLAT
LNHQSRVNAV AFSPDGKTIA TASDDKTARL WDTENGNVLA TLNHQDWVFA VAFSPDGKTI
ATASSDKTAR LWDTENGNVL ATLNHQDWVF AVAFSPDGKT IATASSDNTA RLHWATPEGL
IQEGCRRLSR NLTAEEWQQY INSDLETYQK TCKNIPVHPS LIAEAKNLAK TGEKPKIKQA
ISIFKKALEL EPEIDLDPDT KTRETDPQLV ANKLAASAKL K