Gene Tery_3517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3517 
Symbol 
ID4244342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5420313 
End bp5421527 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content33% 
IMG OID638108491 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_723080 
Protein GI113477019 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGT TAAATTTTTG TCGAATAAGT ACCAAAATTA AGGCTCAAAT ATTGTCCATT 
TTTATAGCTT TGAGTTTTAT ATTTGCTTGT TTCTCAGCAG TACCAGCAAT GGCAGATTCT
AAAGTATTTA ACGGGTTATC AAATACTCAA TTAGTAGCAT TACAAACTAT ACCAGAAAAT
ATCAAGACTA GAAATAGTAA TAGCTTTGTC ACAGAAGCAG TAAATAAAGT TGATTTAGCT
GTAGTTAGAA TTGACACAGA AAGACTAGTT ACCCGTCCCA ATAATAATTT TTTTGAAGAC
CCATTTTTTG ATCGTTTCTT TGATGAAAAC TTAAGGATTC AACCACCTTC AAAAGAATTG
TTAAGAGGTC AAGGTTCCGG TTTTATTGTT GACTCGAAAG GCATAATTTT AACCAATGCT
CATGTAGTCA ATAAAGCTGA CAAAGTTACT GTAACTTTAA ATGATGGTAG ACAATTTATT
GGGGAAGTAA AAGGAACAGA TGAAATTACA GATTTAGCAG TAGTTAAAGT TGATACAAAA
GATGAGATTT TACCAGTAGC AATTTTAGGT GATTCTAATT TAATACAAGT AGGAGATTGG
GCAATAGCAG TAGGAAATCC TCTAGGATTT AATAACACTG TTACTTTAGG AATTATTAGT
ACTTTAAAAC GTCCTAGTTC AGCAATAGGA ATTCCTGATA AGAGACTAGA TTTTATTCAA
ACTGACGCAG CAATTAACCC AGGAAATTCC GGGGGTCCGT TGTTGAATGA TAGGGGTGAA
GTAATTGGAA TTAATACTGC AATTAGAGCT GATGCTATGG GTATTGGTTT TGCTATTCCT
ATAAATAAAG CTAAAGAAAT TAAAGATATA TTAGTTCGTG GAGAACAAGT ACCTCATCCT
TTTATTGGCA TTCAGATGAT TACTCTAAAT CCAGAAATTG CTAAAGAAAA TAATAGTGAC
CCCAATTCTG TTTTAATTTT GCCAGAAGTA AAAGGAGTTT TAGTAACGAG AATATTGCCT
GGTACTCCAG CGGAAAAATC AGGGATGCGC ATAGGAGATG TAATTATAGA AATTGACAAT
CAATCAGTAT TTAGTGCTGA ACAGTTACAG AGAAAAGTTG AAAATAGTGG TGTAGGTGAA
AAATTGCTAT TCAAAGTTAT GCGAAATAAC AGAGAAAAAG AACTATTTGT TGTTAGCGGA
CAAATGAATT ATTAG
 
Protein sequence
MKLLNFCRIS TKIKAQILSI FIALSFIFAC FSAVPAMADS KVFNGLSNTQ LVALQTIPEN 
IKTRNSNSFV TEAVNKVDLA VVRIDTERLV TRPNNNFFED PFFDRFFDEN LRIQPPSKEL
LRGQGSGFIV DSKGIILTNA HVVNKADKVT VTLNDGRQFI GEVKGTDEIT DLAVVKVDTK
DEILPVAILG DSNLIQVGDW AIAVGNPLGF NNTVTLGIIS TLKRPSSAIG IPDKRLDFIQ
TDAAINPGNS GGPLLNDRGE VIGINTAIRA DAMGIGFAIP INKAKEIKDI LVRGEQVPHP
FIGIQMITLN PEIAKENNSD PNSVLILPEV KGVLVTRILP GTPAEKSGMR IGDVIIEIDN
QSVFSAEQLQ RKVENSGVGE KLLFKVMRNN REKELFVVSG QMNY