Gene Tery_2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2558 
Symbol 
ID4244860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3953889 
End bp3954944 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content38% 
IMG OID638107632 
Productpeptidase M23B 
Protein accessionYP_722231 
Protein GI113476170 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA AAAATTATTT TTATGGTTTT GTTACTGCAT TAGTAATAGC ATTAGTAACT 
AACTACCTGC TAAGTGCTAA TGCTCATACC CGACTATTTC AAAATTCTGT CTTTGAAAAT
TCTATTGCTC AGGTATCAAA TAATCCAAAA TTTGCTCTAC CTATTGAATG TAAGCTTGAC
AAAGACTGCT TTATTCTGTT GTATAGCGAT CGCGATCCTA GTCCGAAAGA ACTAGATTTT
GGTTGTGGCA GGCAAACTTA CGATGGTCAC AAAGGAACAG ATTTTGCTAT ACCTGATGAA
AAAATTATGG CACAAGGTGT AGCTGTGACA GCTGTGGCGC CTGGTAAAGT GCTACGAACC
AGAGACGGAA TACCAGATCG CCGAATAATA GATACAGCAG ATCGGGATGC GGTTAAGAAT
ATTGAATGTG GAAATGGAAT AGTCATAGAT CATGGTAATG GTTGGGAAGC TCAATACTGT
CATCTCCGTA ATGGTAGTGT TGTTGTGAAA CCAGGAACGG TTGTCAAAGC AGGTACTCAA
CTGGGAATAG TAGGAACATC TGGTTTATCT TCTTTTCCTC ATGTTCATTT AAGTGTTCGA
TATCAAGGAG AAATTGTAGA TCCATTTGTT GGAGTGAATG TTAAGAGTGG TTGTAATGTT
CCTCGTAATT CAATTTGGGA AGAACCGTTG AGTTATAAAC CAACTGGAAT TATTAGAAGT
GGGTTTGCTA CTGTTGCGCC AACTATGGAT GATTTGTGGT CTGGCAAATT TTACGATACT
GTTTTAGCAG GAAATAGTGC GGCATTAATA TTTTGGGTAC AAATTTATGG GGTTTTGCCT
GGGGATAAAG AACATTATCA ATTATTTGCT CCTAATGGTG AGGGGGTTAT AGACAATAAA
AAGGAAATGA AATCTGCTCA CAAGACATGG ATGGGGTATG TGGGTAAACG CAATAATTCT
CAGTCCCTTC CTATAGGTAA ATGGAGAGGT GAGTATAGTT TGACTAGAGG TGACCAAGTA
TTAGTAAATA TTACAAAAGA GGTTCAATTA AATTGA
 
Protein sequence
MKLKNYFYGF VTALVIALVT NYLLSANAHT RLFQNSVFEN SIAQVSNNPK FALPIECKLD 
KDCFILLYSD RDPSPKELDF GCGRQTYDGH KGTDFAIPDE KIMAQGVAVT AVAPGKVLRT
RDGIPDRRII DTADRDAVKN IECGNGIVID HGNGWEAQYC HLRNGSVVVK PGTVVKAGTQ
LGIVGTSGLS SFPHVHLSVR YQGEIVDPFV GVNVKSGCNV PRNSIWEEPL SYKPTGIIRS
GFATVAPTMD DLWSGKFYDT VLAGNSAALI FWVQIYGVLP GDKEHYQLFA PNGEGVIDNK
KEMKSAHKTW MGYVGKRNNS QSLPIGKWRG EYSLTRGDQV LVNITKEVQL N