Gene Tery_4572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4572 
Symbol 
ID4246226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7033113 
End bp7036100 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content42% 
IMG OID638109445 
Productpeptidase M23B 
Protein accessionYP_724021 
Protein GI113477960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGATA TTTTAGGAGA CTTTAGCAAC TTTGAACAAG ACTTGAGCAA CCTAATATTA 
ATTGGTGATA AGCAAGAAAA TTTCGATGAT GGGCAGAGAT GGACTAGTGG CATTGAGCCA
ACTGGGAATA ATTCTAAACT AAAAGTCGAA GAATTCAATA CAGGTGGCGA TGAACCATCA
CAGCCCCTTC AGGGACAAAA ATTCTTTGAT TTAGGTGAAC TTAACCATTT ATCTGCCTCA
AATTCTCCAC CGGGGAAGAA AGACCCACTA GTAGGTGAGG ACAATGAAGC AGTTGTCAAA
AAAAGTGACA ATTTAATAAA TCCAAATCCA ATTAACCGTC GAAGCGGTAA CAGTAGGAAT
AGGGCTGATA ATATTGGAAC TCTCAGCAGC AGTAGTAGTT TCACTGGTTT TGTTGGAACA
ACAGATACCA ACGACTATTA TCGCTTTTAT CTGAGTGGGG AAAGGGAGTT TAACCTCACT
CTCAACGGCT TAAGCGGTGA TGCGGACGTA CGATTACTCA ATAGTAGTGG TGGTACTATT
AGTAGTTCTA CCAAGGGTGG TAGTAGTTCC GAGAGCATCA GTGAAACTCT CAATTCGGGT
ACATATTATA TTAGGGTTTA TCCAATGAGT GGGGTGAATA CTAATTACAA TCTCAATATA
GAAGCAACTT CATCTTCATC TGATGAAGAG GTAAATATTA CTTCCCCAAG CAGCAGGACC
AGTATTGAAC CGGGAGAAAG GTATAATATT CGCTGGACCG ACAACTTTAG GGATAATGTC
AAACTGGAAT TGTACAAAGG AAGTTCCCGG CAACAAACAA TTGCTCGTTC CACCTCCAGC
GATGGCAGTT ACTCTTGGAG GGCGCCCACA TCTTTAAGCA GCGGTACTAA TTACAGAATC
AAGATTCGCA ATGTGAACGA TAGTAGTGTT TACGACTACA GTAGTTATTT CACTATTGAA
CCAGATGAAC CAGATGGAGT GGTAAATATT ACTTCCCCAA GCAGCAGTAC CAGTATTGAA
CCGGGAGAAA GGTATAATAT TCGCTGGACC GACAACTTTA GGGATAATGT CAAACTGGAA
TTGTACAAAG GAAGTTCCCG GCAACAAACA ATTGCTCGTT CCACCTCCAG CGATGGCAGT
TACTCTTGGA GGGCGCCCAC ATCTTTAAGC AGCGGTACTA ATTACAGAAT CAAGATTCGC
AATGTGAACG ATAGTAGTGT TTACGACTAC AGTAGTTATT TCACTATTGA ACCAGATGAA
CCAGATGAAA AGGTAAATAT CACTTCCCCA AGCAGCAGTA CCAGTATTGA ACCGGGAGAA
AGGTATAATA TTCGCTGGAC CGACAACTTT AGGGATAATG TCAAACTGGA ATTGTACAAA
GGAAGTTCCC GGCAACGAAC AATTGCTCGT TCCACCTCCA GCGATGGCAG TTACTCTTGG
AGGGCGCCCA CATCTTTAAG CAGCGGTACT AATTACAGAA TCAAGATTCG CAATGTGAAC
GATAGTAGTG TTTACGACTA TAGTAGTTAT TTCACTATTA AACCAGATGA AGAGGTAAAT
ATTACTTCCC CAAGCAGCAG TACCAGTATT GAACCGGGAG AAAGCTATAC TATTCGCTGG
ACCGACAACT TCAGCGATAA TGTCAAACTG GACTTGTACA AAGGCAGTTC CTGGCAACAA
ACAATTGCTA GTTCCACCTC CAGCGATGGC AGTTACTCTT GGAGGGTGCC TACATCTTTA
AGCAGCGGTA CTAATTACAA TATCAAAATT CGCAATGTGA ACGATAGTAG TGTTGACGAC
TACAGTAATA GTTTCACTAT CCAAAGCACA ACATGGCCTC CAAGCGTTAC CAAGGATTTG
AAAACCTATA CTGGTCGAGA AGAGTACAGT GGCTATGTAG GCAACGATGA CTACTACAAA
TTTTCTGTTG ACTCCCCTGG ATACCTACAA TTTGCACTGC GGGGAATGAG TGCAGATGCT
AACTTGCAAT TGTTGAATTC TAGCGGTAAA GTCTTAGAAA GTTCCAGCAA ATCAGGCAAT
AGTGATGAAT ATGCAAATGA AAACCTCGGC ATTGGCACTT ATTATATGCG GGTATACGGC
CATAATGGTG CCGATACCAA CTATCGCTTA GTACTGAACC TCGACAAAGC AAAAAATGAC
AGGAGTAATG CTCGTTGGCT CGGTGAGCTA GCAGGACAGC GTAAAGAATA CAAGGACTTT
ATAGGAACCA GCTCAGGGGA TCAGTATGAT TACTATAAAT TTACTGTACA AGAACCTCGT
TTCTTGGAAT ATGCGCTTCG AGACTTGACA GCTCCTGCTG ACATCGATAT ACTCAATTCC
AGTGGCGCTC GAATTACTCC TAAAGAAAAT GATAAAGATA ATCATCGATA TAACCTGCAC
GAAACTATGG GGTTACAAGC TGGTACGTAT TACGCGAGAG TTACGGCACC AACTGACTCT
AGTCAGCAAA CTAACTATAA ATTAGTGCTG AATCTTAAAG GTGAGTATAA GCCCTTACAC
TCTAACCCAA CAGAATCTAA CCCCCTGAAG GGATTCCAAT CTCCTGTCAG AGGTGAGAGA
TGGTATGTTT CACAGTCACC TGGTGGCAGT TATAGTCATA CTGGCAATTT GCGTTACGCT
ATAGATATCA GTATTCCAGG CTGGGATGAT TTTGGTGAAC CAATATATGC CATGCGTTCA
GGAACTGTTA AAAAAGTGGT CGATGATCAT CCAGATATCG CCGATTCTAA GCGAAATAAC
TTAGTGGAAA TCCAACACGA AAATGGCTAT GTTGCTAGGT ATTGGCATCT TCAGCAATAT
TCTAATTCAG ATGCAGGGTT ACAAGTCGGT CAAAAGGTGG ATGCAGGTCA AATGATAGGT
CGAGTAGGAA ACTCTGGTTT CAGTACTGGT CCTCATTTGC ATGTCGATGT TGTAGATTCT
AGCTTGATAA CAAGGCCATT TGAGATAGAA GGAATTTTTG ATTACTAA
 
Protein sequence
MSDILGDFSN FEQDLSNLIL IGDKQENFDD GQRWTSGIEP TGNNSKLKVE EFNTGGDEPS 
QPLQGQKFFD LGELNHLSAS NSPPGKKDPL VGEDNEAVVK KSDNLINPNP INRRSGNSRN
RADNIGTLSS SSSFTGFVGT TDTNDYYRFY LSGEREFNLT LNGLSGDADV RLLNSSGGTI
SSSTKGGSSS ESISETLNSG TYYIRVYPMS GVNTNYNLNI EATSSSSDEE VNITSPSSRT
SIEPGERYNI RWTDNFRDNV KLELYKGSSR QQTIARSTSS DGSYSWRAPT SLSSGTNYRI
KIRNVNDSSV YDYSSYFTIE PDEPDGVVNI TSPSSSTSIE PGERYNIRWT DNFRDNVKLE
LYKGSSRQQT IARSTSSDGS YSWRAPTSLS SGTNYRIKIR NVNDSSVYDY SSYFTIEPDE
PDEKVNITSP SSSTSIEPGE RYNIRWTDNF RDNVKLELYK GSSRQRTIAR STSSDGSYSW
RAPTSLSSGT NYRIKIRNVN DSSVYDYSSY FTIKPDEEVN ITSPSSSTSI EPGESYTIRW
TDNFSDNVKL DLYKGSSWQQ TIASSTSSDG SYSWRVPTSL SSGTNYNIKI RNVNDSSVDD
YSNSFTIQST TWPPSVTKDL KTYTGREEYS GYVGNDDYYK FSVDSPGYLQ FALRGMSADA
NLQLLNSSGK VLESSSKSGN SDEYANENLG IGTYYMRVYG HNGADTNYRL VLNLDKAKND
RSNARWLGEL AGQRKEYKDF IGTSSGDQYD YYKFTVQEPR FLEYALRDLT APADIDILNS
SGARITPKEN DKDNHRYNLH ETMGLQAGTY YARVTAPTDS SQQTNYKLVL NLKGEYKPLH
SNPTESNPLK GFQSPVRGER WYVSQSPGGS YSHTGNLRYA IDISIPGWDD FGEPIYAMRS
GTVKKVVDDH PDIADSKRNN LVEIQHENGY VARYWHLQQY SNSDAGLQVG QKVDAGQMIG
RVGNSGFSTG PHLHVDVVDS SLITRPFEIE GIFDY