Gene Tery_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4034 
Symbol 
ID4242062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6237948 
End bp6239858 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content37% 
IMG OID638108943 
Productcell wall hydrolase/autolysin 
Protein accessionYP_723524 
Protein GI113477463 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.172478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAATT TTAATCCACT GCTAATGGTT ATAAGAAAAG CAATAAATTT CCAAGTATTT 
AGGAAAATTA TTAGTTTATC TACCCTAGAG TTAATAATAT TTTTAGGACT AATAAATATC
ATTTCTCCTA ATGCTATGTC ACAAAATTAT GAAGATAATA ATCAACTTCC ACAGGACTCC
AAGTCAAGGA AAAAATCTCT ATTTATTGCT TATCCTTCAC CCAATCATAA AACTACTTCT
GACAGAATTT TCTTAATTGG TACAGCTTCT CCAGAAGCTG AAGTTACTGT TAATGGAAAA
CTAATTAATG AGCGTAGTCG AATGGGTCAT TTTGCTCCTA CTTTTCCATT ACAAATAGGG
GAAAATATAT TTACTGTACA ACATCAAAAT GAAGAGATTA TTTTAAAAAT AACACGCAAT
TCAAATCAAC CACCTTTACC AGTAGGGGTA GCTTTTACTC AAGATTCTTT GACTCCGATA
AAAGATATTG CTAGAATGCC CGGGGAGTTG ATTTGTTTTC AGGCGATCGC TCCTCCAAAC
GCTAATATTT CTGTATTATT AGCTACCCAA ACTATTCCCC TATTTGCCCA ATCTCAAATA
GTAAATTTAC CAGGAAATTT AGCAGTATTA ACAGGAGATA ATCAACCTTT TCCCTCTGGG
GGAGACTATT ATCAAGGTTG TACAAAAGTA GAAACACCAG GAGAGTTAGG TCAACCTGAG
TTTCAACTTA GTCTCATGGG GAAAACAGTG ACTCAAAAAG CACCGGGAAA AGTGAGTATT
CTTTCTCCAA CTACGTTTGA AATAGCTGAG GTGACAGTAG AAGAGGGAGT TGCCAGAACC
GGACCCAGTA CCACCTATTC ACGCTTGACA CCATTACCAA AAGGTACTAA AGCTTTAATT
ACAGGGAAAG AGGGAGATTA TTTACGCCTT GACTATGGTG GGTGGATCAA AGCAAACGAG
ACCAGGATAT TTACAGATGC AACACCACCG CGTTCGGTGA TTCGGAGTGC GATCGCTCGT
CAGGTTCAAG GTGCAACGGA AATTAGGTTT CCTCTGCAAG TTCCGGTGCC AGTAACAGTT
GAACAGGGCG CTCGCTATTT AAGTTTAACT TTGCATAATA CTACTGCTCA AACTGATACT
ATTCGCCTAG ATGATGATCC CTTAATTGAG AGATTAGACT GGCAACCAGT TTTGACTTCA
ACAGTACAAA ATGAACAGGC AGTTAGATAT AAGTTTAACT TAAAAACAGA CCAACAATGG
GGTTATAAAT TACAATATGT TGGTACAACT TTACTATTAA CTTTGCGACA TCCTCCAGCG
GTAAAAAGTG TTATTAGTTC TGCTACTCAA CCTTTAACTG GAATGAAAAT TTTAATTGAT
GCAGGACATG GTAGTGAGAA TGATCTAGGG GCAATAGGTC CGACTGGTTA TCCTGAAAAA
AATGTGACAT TAATTATATC AAAGCTATTA CAAAATGAGT TAATTAATCG AGGAGCTTTA
GTGTATATGA CACGAAAGGC AGAGGAAGAT TTATACCCTA AAGACCGTGT AGAAATGATT
AATCAACAAG TTCCAGATTT AGCACTTTCT GTTCATTATA ATGCTTTACC TGATTATGGA
GATGCCCTAA AAACTCAGGG AATTGGTACT TTTTGGTATC ATTCTCAAGC TCATAGTTTG
GCAATATTTT TACATAATTA TTTAGTGGAA AAATTAGACC GACCTTCCTA TGGAGTATTT
TGGAATAATT TGGCTTTAAC TCGTCCCGCG ATCGCTCCTT CTGTGTTGTT AGAATTAGGA
TTTATGATTA ACCCTTATGA ATTTGAATGG ATTATGAATT CTCAAGAACA ACAGAAATTA
GCAAAAGCAT TAGCAGATGG AATTGTGGAG TGGGTCAAAA AAAGTCAGTA A
 
Protein sequence
MGNFNPLLMV IRKAINFQVF RKIISLSTLE LIIFLGLINI ISPNAMSQNY EDNNQLPQDS 
KSRKKSLFIA YPSPNHKTTS DRIFLIGTAS PEAEVTVNGK LINERSRMGH FAPTFPLQIG
ENIFTVQHQN EEIILKITRN SNQPPLPVGV AFTQDSLTPI KDIARMPGEL ICFQAIAPPN
ANISVLLATQ TIPLFAQSQI VNLPGNLAVL TGDNQPFPSG GDYYQGCTKV ETPGELGQPE
FQLSLMGKTV TQKAPGKVSI LSPTTFEIAE VTVEEGVART GPSTTYSRLT PLPKGTKALI
TGKEGDYLRL DYGGWIKANE TRIFTDATPP RSVIRSAIAR QVQGATEIRF PLQVPVPVTV
EQGARYLSLT LHNTTAQTDT IRLDDDPLIE RLDWQPVLTS TVQNEQAVRY KFNLKTDQQW
GYKLQYVGTT LLLTLRHPPA VKSVISSATQ PLTGMKILID AGHGSENDLG AIGPTGYPEK
NVTLIISKLL QNELINRGAL VYMTRKAEED LYPKDRVEMI NQQVPDLALS VHYNALPDYG
DALKTQGIGT FWYHSQAHSL AIFLHNYLVE KLDRPSYGVF WNNLALTRPA IAPSVLLELG
FMINPYEFEW IMNSQEQQKL AKALADGIVE WVKKSQ