Gene Tery_2624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2624 
Symbol 
ID4245349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4059509 
End bp4061926 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content36% 
IMG OID638107693 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_722292 
Protein GI113476231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.660296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAA AGCGGCGGGA ATTTCTGCAA AGGGCTAGTT TGGCCCTAGG AGTATTGGGA 
ATCAATCAAG CAGGGTGGTG GCGCCTCCAA AATTATTATT CAGAAGCCTT AGCTCAAACT
ACAGGAGGTA AATTAGCATT CTTAGTAGGT ATTAATGAAT ATCTAAATGC ATCTCTGTCT
GGGTGTGTTA CAGATGTGGA GATGCAACGG GAATTGCTAA TTCATCGGTT TGGTTTTTTA
CCTGCTGATA TTCTGACTCT CACAAATGAA CAAGCTACTA GGGAAAATAT AGAAACAGCT
TTTATTAGTC ATTTAACTGA TCAAGCTAAA CCAGATGACT TGGTTGTGTT TCATTTTAGT
GGCTATGGTA GTCGTGTGAC TAAGATGATT GATCAAAATA AACAGAATGA ATTAAGTACC
TCAAATTTAA TTTTTCAAAA TAGTCTAGTT CCAATAGATG GTATAGCTTC GAATAATGAG
GGCGCAGAAA TAAATGATGT TTTAGAAGAG ACTCTCTGGT TATTATTGCG ATCGCTCCCT
ACGAAAAAAG TAGTAACTAT ATTAGATACT AGCTATGTAT ATCCTGGTAA AAATTTGCAA
GGTAATCTCA GAATTCGCTC TCGACCTAGT TTGACTACAG AGCAAATCAA TATGAAAGAG
AAAGCAATGC AAATGCAACT GCGACATCAA CAAAATATTT TAGGAGAACA GGAAATTTAT
GAAAGTCAAC TCAAAGGCTT AGTTCTTAAG GCTAGCCAAC AAAACCAATT TGCTACTGAA
GCTGACTGGA ATGGTTTTAG CTCTGGTTTA TTTACTTATG CTCTTACCCA AAATTTATGG
TCTACAGCCC CCGCCACCAG ACTACAAATT AGTTTTTCAC AAGCTGCTGG CGAGGTAGAA
CAAATAACTG GAGTTAATCA GCAACCAGAG TTTAAACAAG CAAAACAACA GAACATTGCT
CAATCTACAA AAGTACCTAC TATAATTCCA GTAGCTATCA GTAATCAAAA TATCTTTAAT
CCTTTATTAC CCTCTGCTAC GGGTGTAATC ACTTCAGTAG AAGATAACGG CAAAACAGCA
AACCTATGGT TAGGAGGATT ACCAGTTAGT ATTCTTGATA CGGTTACATC TTATTCTCTA
TTCAAAACTT TTCCTGATTT TGATTTAGAC TTAAGCTTTG AATTACAAAT TCAGAATCGA
AATGGCTTAA AAGCAAAAGC TTCCGTTATC ATTGACACTA ATAATTCACA ACAATTATCA
GATATAAACC AGGAAAATAT CAACGACCAA ACACAAAAAA CTTCTCAAAA TCTTCTTCCT
AATCTTCAAC TTTTGACAGG ATCTCTGGTA CAAGAAAAGA TTAGAGTTTT ACCTCAATCT
CTGAATTTGA TGATAGCTCT AGACCATAAT ATGTCAAGAA TAGAGCGGGT AGATGCTACC
AGTGGTTTTT CAGCAATACC TAAAGTTACC GTAGTAGCAG ATACACAAGC CGCTGATTAT
ATATTTAGTC GAGTAGAAGA AACAACTATC GCTCAAAGTT CTTCTGCTAT TTTACCCTCT
ACTTCTCAGG GTCATTATGC TCTATTTTCT CTGGGCAAGG TACTTATACC TAAGACTATA
GGAGAAAAAG GAGAAGCAGT CAAGGGAGCA ATAGAAAGGT TACGTTCCCC ATTGGAAACC
TTACTGGCAG CTAAGATGCT ACGCTTAACT AAGAATGAAG GATCTACTCA AATCAAACTT
AGAGCTAACT TGGCTACTGT TACTCCAAAT GCAAGGTTAA TTCTTAAGCG AGAAACTTTG
GGGGTTAGGG ATGGAACCAC AAAATTTTTA ACTGATATCA ATGAAGATAT AGACAAATCT
GAAATTAGTA ATTCATTAGG AACTTTACCC ATTGGTAGTC GAATTCAATA TCAATTATAT
AATTATGGCG ATCGCCCAGT CTATTTTATG TTAGTCATTT TAGACAGTAG TGGGCGCTTT
TTTGTTGTCG ATCCAACTAT ATCTAATCAA CTTGAGAATG ATCCTCAACT TTCACTAACA
GAGTTAATTG TTTCTCCGGG AGATAGTGTA AATATACCTC CAGTTATTAA TAATTCTTCT
AAAAATACAA AATCTTTGGG ATGGAGGGTA ATAGGTTCTG AAGGATTAGA AGAAAGCCTA
ATTATTTGTA GCCATCAACC ATTTAAGAAA ACAATTGCAA CTCTTGTTGA TGGAAAGGTG
TTTCAAATTA GAGATGATCA GCTAATTAAA GAAATATTGA ATCCTTTAGC AGTCGCACAA
GCAACTTTAC AAGATTTACA GAGTGCTAGT CAATTAGCTA CTCAATCTTT AGAGTTATCA
ACGGATAATT ATGCTCTCGA TATTAATAGT TGGGCAACTC TGAGTTTCGT TTATCGAGTT
GTTAAAAAAA CACTTTGA
 
Protein sequence
MGLKRREFLQ RASLALGVLG INQAGWWRLQ NYYSEALAQT TGGKLAFLVG INEYLNASLS 
GCVTDVEMQR ELLIHRFGFL PADILTLTNE QATRENIETA FISHLTDQAK PDDLVVFHFS
GYGSRVTKMI DQNKQNELST SNLIFQNSLV PIDGIASNNE GAEINDVLEE TLWLLLRSLP
TKKVVTILDT SYVYPGKNLQ GNLRIRSRPS LTTEQINMKE KAMQMQLRHQ QNILGEQEIY
ESQLKGLVLK ASQQNQFATE ADWNGFSSGL FTYALTQNLW STAPATRLQI SFSQAAGEVE
QITGVNQQPE FKQAKQQNIA QSTKVPTIIP VAISNQNIFN PLLPSATGVI TSVEDNGKTA
NLWLGGLPVS ILDTVTSYSL FKTFPDFDLD LSFELQIQNR NGLKAKASVI IDTNNSQQLS
DINQENINDQ TQKTSQNLLP NLQLLTGSLV QEKIRVLPQS LNLMIALDHN MSRIERVDAT
SGFSAIPKVT VVADTQAADY IFSRVEETTI AQSSSAILPS TSQGHYALFS LGKVLIPKTI
GEKGEAVKGA IERLRSPLET LLAAKMLRLT KNEGSTQIKL RANLATVTPN ARLILKRETL
GVRDGTTKFL TDINEDIDKS EISNSLGTLP IGSRIQYQLY NYGDRPVYFM LVILDSSGRF
FVVDPTISNQ LENDPQLSLT ELIVSPGDSV NIPPVINNSS KNTKSLGWRV IGSEGLEESL
IICSHQPFKK TIATLVDGKV FQIRDDQLIK EILNPLAVAQ ATLQDLQSAS QLATQSLELS
TDNYALDINS WATLSFVYRV VKKTL