Gene Tery_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3710 
Symbol 
ID4243886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5699408 
End bp5700592 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content34% 
IMG OID638108656 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_723242 
Protein GI113477181 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCA AATTTTTGCA AAAAGCTACA TCAAGAATAT TAGCACTAGG GATAGGAGTA 
AGTGTAAATT TCACTAACTT TTCTGTTGTC AATGCTCAAA TAAGAGCAGA TAAAACTATA
AAAATTCGGT ACAAAAACCA AATAGAATCA ATATTTGTTC AGAGTATAGA AGAACAAACT
AATATTCGTG TTTATCAACA AACAAATCAA GCAGTAGTCT CCATAGATAC AGATAAAGCA
AACGGTAGTG GTACGATTAT TTCCCGCGAT GGAATGGTAC TAACTAATGC TCATGTCGTG
TCCCAAGGTG GTATTGTAAA AATCACTTTA GCAGATGGAA GAAAAGTTGA AGCAGATGTC
ATAGGTTTTG GTGAAAAAGG TTTAGACTTA GCAGTTCTAA AAATTAGAGG AGAAACAAAT
CTTCCTACCA TTAGGATAGC TAGCTCTGGT GATATAAAAG TAGGTCAGCG AGCTTTTGCT
ATAGGTAATC CTTTTGGTAG ATTTCAAGGT ACTCTTACTG TAGGAATAGT TAGTCGTATA
GATGAAGAAA GAGGTTTGAT TCAAACAGAT GCAGCTATTA ATCCTGGTAA CTCTGGGGGA
CCATTGTTAA ATAGTGCTGG AGAACTCATC GGTGTTAATA CTGCTATACT TACTCCTAGT
CAATTAGGGG GTAATATTGG TATTGGCTTT GCTATATCTA TAGATAAAGT CCCAGAGTTG
CTCAAGGCAG TTCGAGAAAG ACGAGCACCT TTAGTCGCTC AACATTCAAG AAGTAGGATG
TTTGATGAGG ACTTTGCTAA AAAATTAGAC TTTGATGCTT TAATTGAAGT TCAAGGCAAT
TTAGATGGAG AATCTAATGT ATTACCTGTA GATAATAGTT ACTATGATCT TTATGCTTTT
AAAGGTAGAG CAGGTCAAAA AATTTCTATT GATATGAGTA GTAATCAAAT AGATTCTTAC
TTGATTTTAT TAAACTCAGA AGATCAAGAA TTAGCTCAAG ATGATGATAG TGGGGAAGAC
AAAAATGCTA GGATTATAAT TATATTACCA AAGGATGGAA CTTATAAGTT ATTAGCAAAT
TCTTATGAAG CAGGAGAGTC AGGAAAGTAT GAGTTGAAAA TTGAAGCAAT TTCACCTAAA
ATGAGAATTT TATTTGAGCA AGAAAAGAGT AACTTTTATA GATAA
 
Protein sequence
MKVKFLQKAT SRILALGIGV SVNFTNFSVV NAQIRADKTI KIRYKNQIES IFVQSIEEQT 
NIRVYQQTNQ AVVSIDTDKA NGSGTIISRD GMVLTNAHVV SQGGIVKITL ADGRKVEADV
IGFGEKGLDL AVLKIRGETN LPTIRIASSG DIKVGQRAFA IGNPFGRFQG TLTVGIVSRI
DEERGLIQTD AAINPGNSGG PLLNSAGELI GVNTAILTPS QLGGNIGIGF AISIDKVPEL
LKAVRERRAP LVAQHSRSRM FDEDFAKKLD FDALIEVQGN LDGESNVLPV DNSYYDLYAF
KGRAGQKISI DMSSNQIDSY LILLNSEDQE LAQDDDSGED KNARIIIILP KDGTYKLLAN
SYEAGESGKY ELKIEAISPK MRILFEQEKS NFYR