Gene Tery_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0022 
Symbol 
ID4241829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp36830 
End bp38158 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content28% 
IMG OID638105400 
Producttetratricopeptide TPR_2 
Protein accessionYP_720019 
Protein GI113473958 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCATA CTAATCAAAC TGAAAGAAAG TACTATTCCC ATTATTCTGA TGATCAAATT 
AGGCAAATAG TGGTAGAAAA ATATAAAACT TGTCTAGAAA AGTTTGGGTA TTTATTTGAA
TTAGAAATTA AATATAAATT TGATAAAAAT TATCAGATAA AAATATCTAT TCCCATTTAT
AAAAAGTTTA TGGTTCCAGT GATTAATAAT TTCATTTCAA AAGCTAATAA ATTAAAAAAA
GAGGAGAAGC TAGATGAAGC GATCGCTCTT TACAAACAAG CTATAGAATT AAATCCCGGT
TTTCCTTGGT ATTACTATGA ACTAGGAGAA ACTTTAACAG AAAAAGGAGA TTTAGATGAA
GCAATAGCTC AGTTTTTTCA AGCTACTAAA ATTAATCCTA ACTCTGCCTG GTTTTGTTAT
GAACTGGCAA AAGCTTTACA AAAAAAAGGT GAGTTAAATC AGGCAATTAA TTATTATAGA
CAAGCTATTG AAAGTAACCC AGACTTTTAT GGATTTTACA ATAGTCTAGG TTGTTGCTTT
TTCCAGAATA AAAATTATCA TCAAGCTATA ATTAATTTTT TAATCGCACT GGAATTAAAC
TCCAGTTCAG TAATTAGCTA CGAATATTTA TATCAAGTAT ACTTACAACT TGGTTTGACT
AAAAAATGTA TTGATAATTT GTTTGGTAGA TTTAAACAAC CTAGTCAAAA ACCTACTTTA
ACTATTCGCA TAAAAGAAAA ATCTGTCAAT TCAAAACAGG AATTATCGAA AACAAAAATA
TTTGGAATTG GTTTAGGAAA AACAGGTACC ACAACTTTGG GAGCTTGCTT AAATAATCTT
GGATATAAAC ACTATGGTTG GTATACTTTG ACTAATCACA AACTGCTTTA CCAAATTAAA
TTAGATAATT TTGATGATGT TCATAGAGTA GTCAATCAAT ATGATTGTTT TGAAGATTAT
CCGTGGCCTT TGATTTACAA ATGGCTCGAT CAAAAATACC CTAATAGTAA ATTTATATTA
ACCATAAGGA AATGTTCTCA AACTTGGTTT AAAAGTTGTT TAAATCATTA TTTTCGACTA
CAAGATAGAG CAGCTTATCT CTATGATTTA ATCTATGGTT TAGGTCATCC ACAGAATTGT
TCAGATGAGT ATATAAATTT TTACGAAAGT CATAATCAAC AAGTTATTGA TTACTTTCAA
AGTAAACCGG ATAAATTACT GATAATTTGT TGGGAAAAAG GAGATAGTTG GGATAAATTA
TGTGGTTTTC TCGGAAAAAC AGTTCCTGAA ATTCTTTTGC CTCATTTGAT GAAAAGTCAA
ACTAATTAA
 
Protein sequence
MPHTNQTERK YYSHYSDDQI RQIVVEKYKT CLEKFGYLFE LEIKYKFDKN YQIKISIPIY 
KKFMVPVINN FISKANKLKK EEKLDEAIAL YKQAIELNPG FPWYYYELGE TLTEKGDLDE
AIAQFFQATK INPNSAWFCY ELAKALQKKG ELNQAINYYR QAIESNPDFY GFYNSLGCCF
FQNKNYHQAI INFLIALELN SSSVISYEYL YQVYLQLGLT KKCIDNLFGR FKQPSQKPTL
TIRIKEKSVN SKQELSKTKI FGIGLGKTGT TTLGACLNNL GYKHYGWYTL TNHKLLYQIK
LDNFDDVHRV VNQYDCFEDY PWPLIYKWLD QKYPNSKFIL TIRKCSQTWF KSCLNHYFRL
QDRAAYLYDL IYGLGHPQNC SDEYINFYES HNQQVIDYFQ SKPDKLLIIC WEKGDSWDKL
CGFLGKTVPE ILLPHLMKSQ TN