Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2624 |
Symbol | |
ID | 4245349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4059509 |
End bp | 4061926 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638107693 |
Product | peptidase C14, caspase catalytic subunit p20 |
Protein accession | YP_722292 |
Protein GI | 113476231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.660296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTAA AGCGGCGGGA ATTTCTGCAA AGGGCTAGTT TGGCCCTAGG AGTATTGGGA ATCAATCAAG CAGGGTGGTG GCGCCTCCAA AATTATTATT CAGAAGCCTT AGCTCAAACT ACAGGAGGTA AATTAGCATT CTTAGTAGGT ATTAATGAAT ATCTAAATGC ATCTCTGTCT GGGTGTGTTA CAGATGTGGA GATGCAACGG GAATTGCTAA TTCATCGGTT TGGTTTTTTA CCTGCTGATA TTCTGACTCT CACAAATGAA CAAGCTACTA GGGAAAATAT AGAAACAGCT TTTATTAGTC ATTTAACTGA TCAAGCTAAA CCAGATGACT TGGTTGTGTT TCATTTTAGT GGCTATGGTA GTCGTGTGAC TAAGATGATT GATCAAAATA AACAGAATGA ATTAAGTACC TCAAATTTAA TTTTTCAAAA TAGTCTAGTT CCAATAGATG GTATAGCTTC GAATAATGAG GGCGCAGAAA TAAATGATGT TTTAGAAGAG ACTCTCTGGT TATTATTGCG ATCGCTCCCT ACGAAAAAAG TAGTAACTAT ATTAGATACT AGCTATGTAT ATCCTGGTAA AAATTTGCAA GGTAATCTCA GAATTCGCTC TCGACCTAGT TTGACTACAG AGCAAATCAA TATGAAAGAG AAAGCAATGC AAATGCAACT GCGACATCAA CAAAATATTT TAGGAGAACA GGAAATTTAT GAAAGTCAAC TCAAAGGCTT AGTTCTTAAG GCTAGCCAAC AAAACCAATT TGCTACTGAA GCTGACTGGA ATGGTTTTAG CTCTGGTTTA TTTACTTATG CTCTTACCCA AAATTTATGG TCTACAGCCC CCGCCACCAG ACTACAAATT AGTTTTTCAC AAGCTGCTGG CGAGGTAGAA CAAATAACTG GAGTTAATCA GCAACCAGAG TTTAAACAAG CAAAACAACA GAACATTGCT CAATCTACAA AAGTACCTAC TATAATTCCA GTAGCTATCA GTAATCAAAA TATCTTTAAT CCTTTATTAC CCTCTGCTAC GGGTGTAATC ACTTCAGTAG AAGATAACGG CAAAACAGCA AACCTATGGT TAGGAGGATT ACCAGTTAGT ATTCTTGATA CGGTTACATC TTATTCTCTA TTCAAAACTT TTCCTGATTT TGATTTAGAC TTAAGCTTTG AATTACAAAT TCAGAATCGA AATGGCTTAA AAGCAAAAGC TTCCGTTATC ATTGACACTA ATAATTCACA ACAATTATCA GATATAAACC AGGAAAATAT CAACGACCAA ACACAAAAAA CTTCTCAAAA TCTTCTTCCT AATCTTCAAC TTTTGACAGG ATCTCTGGTA CAAGAAAAGA TTAGAGTTTT ACCTCAATCT CTGAATTTGA TGATAGCTCT AGACCATAAT ATGTCAAGAA TAGAGCGGGT AGATGCTACC AGTGGTTTTT CAGCAATACC TAAAGTTACC GTAGTAGCAG ATACACAAGC CGCTGATTAT ATATTTAGTC GAGTAGAAGA AACAACTATC GCTCAAAGTT CTTCTGCTAT TTTACCCTCT ACTTCTCAGG GTCATTATGC TCTATTTTCT CTGGGCAAGG TACTTATACC TAAGACTATA GGAGAAAAAG GAGAAGCAGT CAAGGGAGCA ATAGAAAGGT TACGTTCCCC ATTGGAAACC TTACTGGCAG CTAAGATGCT ACGCTTAACT AAGAATGAAG GATCTACTCA AATCAAACTT AGAGCTAACT TGGCTACTGT TACTCCAAAT GCAAGGTTAA TTCTTAAGCG AGAAACTTTG GGGGTTAGGG ATGGAACCAC AAAATTTTTA ACTGATATCA ATGAAGATAT AGACAAATCT GAAATTAGTA ATTCATTAGG AACTTTACCC ATTGGTAGTC GAATTCAATA TCAATTATAT AATTATGGCG ATCGCCCAGT CTATTTTATG TTAGTCATTT TAGACAGTAG TGGGCGCTTT TTTGTTGTCG ATCCAACTAT ATCTAATCAA CTTGAGAATG ATCCTCAACT TTCACTAACA GAGTTAATTG TTTCTCCGGG AGATAGTGTA AATATACCTC CAGTTATTAA TAATTCTTCT AAAAATACAA AATCTTTGGG ATGGAGGGTA ATAGGTTCTG AAGGATTAGA AGAAAGCCTA ATTATTTGTA GCCATCAACC ATTTAAGAAA ACAATTGCAA CTCTTGTTGA TGGAAAGGTG TTTCAAATTA GAGATGATCA GCTAATTAAA GAAATATTGA ATCCTTTAGC AGTCGCACAA GCAACTTTAC AAGATTTACA GAGTGCTAGT CAATTAGCTA CTCAATCTTT AGAGTTATCA ACGGATAATT ATGCTCTCGA TATTAATAGT TGGGCAACTC TGAGTTTCGT TTATCGAGTT GTTAAAAAAA CACTTTGA
|
Protein sequence | MGLKRREFLQ RASLALGVLG INQAGWWRLQ NYYSEALAQT TGGKLAFLVG INEYLNASLS GCVTDVEMQR ELLIHRFGFL PADILTLTNE QATRENIETA FISHLTDQAK PDDLVVFHFS GYGSRVTKMI DQNKQNELST SNLIFQNSLV PIDGIASNNE GAEINDVLEE TLWLLLRSLP TKKVVTILDT SYVYPGKNLQ GNLRIRSRPS LTTEQINMKE KAMQMQLRHQ QNILGEQEIY ESQLKGLVLK ASQQNQFATE ADWNGFSSGL FTYALTQNLW STAPATRLQI SFSQAAGEVE QITGVNQQPE FKQAKQQNIA QSTKVPTIIP VAISNQNIFN PLLPSATGVI TSVEDNGKTA NLWLGGLPVS ILDTVTSYSL FKTFPDFDLD LSFELQIQNR NGLKAKASVI IDTNNSQQLS DINQENINDQ TQKTSQNLLP NLQLLTGSLV QEKIRVLPQS LNLMIALDHN MSRIERVDAT SGFSAIPKVT VVADTQAADY IFSRVEETTI AQSSSAILPS TSQGHYALFS LGKVLIPKTI GEKGEAVKGA IERLRSPLET LLAAKMLRLT KNEGSTQIKL RANLATVTPN ARLILKRETL GVRDGTTKFL TDINEDIDKS EISNSLGTLP IGSRIQYQLY NYGDRPVYFM LVILDSSGRF FVVDPTISNQ LENDPQLSLT ELIVSPGDSV NIPPVINNSS KNTKSLGWRV IGSEGLEESL IICSHQPFKK TIATLVDGKV FQIRDDQLIK EILNPLAVAQ ATLQDLQSAS QLATQSLELS TDNYALDINS WATLSFVYRV VKKTL
|
| |