Gene Tery_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2994 
SymbolsecY 
ID4245110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4647560 
End bp4648900 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content38% 
IMG OID638108029 
Productpreprotein translocase subunit SecY 
Protein accessionYP_722622 
Protein GI113476561 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.389258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0673077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTA GTAGAGATAA GGCACCAACT GCTCAAGAGA CATTCTTGCA GATGGCCCAA 
GCAGCAGGAT TGCGAGGGCG ACTGCTAGTA ACAATAGGTT TACTTCTATT AGTGCGTTTA
GGTGTTTATC TACCAGTACC AGGAATAAAT AGAGAAGCTT TTCAAGCTAG AATTGGAGAC
CAGGCTTTAA CAGGCTTCCT AGACCTTTTT TCTGGTGGTG GTTTTTCAGC ATTAGGTATT
TTTGCATTAG GGATTATACC TTACATTAAC GCATCAATTA TTATTCAATT GATGACGGCA
GCGCTACCAA GTTTAGAAAA CTTGCAAAAA AATGAGGGTG AGGCAGGACG TCGAAAGATT
TCTCAAATTA CTCGCTATGT AGCACTGGGT TGGTCGGTTT TACAAAGCTT TGGTTTAGCT
ATATATCTTA ATAGCTCATC TGTAGATATT AATGGAGAAC TGGTAACTGT ATCAATTAAT
CCAGGATCTC TATTTATTGC TAAAACTATT TTGGCAATTA CAGCAGGTTC AATGTTTGTG
ATGTGGATAT CAGAACTAAT TACAGAAAGA GGTATTGGAA ATGGTGCATC CTTACTTATT
TTTGTAAATA TTGTTGCTGT ACTGCCTCAG TCATTAGGTG ACACTATTAA ACTTTTTGAA
GGAGGAGACC GAGCAATAGT TGGTCGGGCA ATTATTTTGC TGCTGGTATT CCTCGTAATG
ATAGTAGGTA TTGTATTTGT ACAAGAGGGC AGTCGCAGGA TACCTATACT TTCAGCTCGT
CGTCAAGTGG GTAGAAAATT ATATAGGGAA ACAAAAAGCT ATCTACCTTT ACGGTTGAAC
CAAGGAGGGG TAATGCCAAT TATTTTTGCC TCAGCAGTTC TGATATTACC TGTATCCTTA
GCTCAGTTTG CGAATAGCCC TATTATATCT CAGATTGCTA CTGCTATAAG TCCTAGTGGT
CCTACACCCT GGCTATATGC TTTATTTTAT TTCGTTTTGA TCCTTTTCTT CAGTTACTTC
TACGCTTCTC TAATTATGAA CCCAGTGGAT ATGTCCCAAA ATCTGAAGAA AATGGGGGCA
AGTATTCCAG GTATTCGTCC AGGTAAAACG ACCAGTGACT ATATAGAGAA AGTTTTGAAT
AGGTTGACTT TCCTAGGAGC TATTTTTCTG GGTTTAGTAG CTATAGTTCC TACTATAGTA
GAAAGTGCTA CTCGTGTACC TACATTTAGA GGATTAGGAG CTACTTCTCT ATTAATTCTA
GTAGGTGTAG CAATAGATAC AGCCAAACAA ATTCAAACTT ATGTTATCTC TCAAAGATAC
GAAGGAATGG TTAAGCAATA G
 
Protein sequence
MDISRDKAPT AQETFLQMAQ AAGLRGRLLV TIGLLLLVRL GVYLPVPGIN REAFQARIGD 
QALTGFLDLF SGGGFSALGI FALGIIPYIN ASIIIQLMTA ALPSLENLQK NEGEAGRRKI
SQITRYVALG WSVLQSFGLA IYLNSSSVDI NGELVTVSIN PGSLFIAKTI LAITAGSMFV
MWISELITER GIGNGASLLI FVNIVAVLPQ SLGDTIKLFE GGDRAIVGRA IILLLVFLVM
IVGIVFVQEG SRRIPILSAR RQVGRKLYRE TKSYLPLRLN QGGVMPIIFA SAVLILPVSL
AQFANSPIIS QIATAISPSG PTPWLYALFY FVLILFFSYF YASLIMNPVD MSQNLKKMGA
SIPGIRPGKT TSDYIEKVLN RLTFLGAIFL GLVAIVPTIV ESATRVPTFR GLGATSLLIL
VGVAIDTAKQ IQTYVISQRY EGMVKQ