Gene Tery_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3351 
Symbol 
ID4243445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5140081 
End bp5142144 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content40% 
IMG OID638108335 
Productpeptidase M23B 
Protein accessionYP_722926 
Protein GI113476865 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0955739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCAG AAAATCAAAA TCAATCAATT GAGTCTAAAG CTGAAATACA GGAAATAACT 
CAAGGGCCGA ACCAAAAGCA AAATTTGAAA ATCACCAAAC TCTTCCCTCG CAAATCACTT
ATGCAGCTAG GTTGTTGTCT AGCTAGTTTA GGGATGCTGA GTAGAGGTAT GGTTTTAGGG
GAGGTGGTTT TGGAAATAGA AGTATATGAC TATGACCCAT CATATCAACC CGCCCCAGTA
CCCCAAACAT CTTATTCAGC ACCATCTTAT TCACCAGAGC CATATTATTC GCCAGCACCA
TCTTATTCAC CAGAGCCATA TTATTCGCCA GCACCATCTT ATTCACCAGA GCCATATTAT
TCACCAGCAC CATCCTATTC ACCAGCACCA TATTATTCTC AACCCTACTA CGAACCAGCA
CCATATTATT CACCAGCACC ATCTTATTCA CCAGAACCAT ATTATTCTTC ACCAGCACCA
AGTTATTACG AACCAGAACC ATATTATTCT CAACCCTACT ACGAACCAGA GCCATATTAT
TCACCGGAAC CATATTATTC TCAACCAGCA CCAACTTATG AACCGGAACC TTATTACTAT
TCCCAGCCAG CACCAATTTA TGAACCGGAA CCAGAACCGG AACCAGACAG TACTATCTAT
CTACCCAAAA ATTTATATGT AGCACCTCCG ACTAATACTA ATATCAACCT ATCAAAAAAA
TCTAAAAATA ACTTACCACC TCTAACACTA CCAAAACCTG AGAAGCTTTA TTATTATAAT
GGTTCGACTA ATGCTTATAT AGATGAGACA GATTATAAAG TGGGTGCAAC TGATAGTTAT
GAGGCACCAA GTACAGTAGT TTTCTCAGAG CGTTCTACAG GTGAAGTTGC TCAATATCCA
ACAAATAATA AAAATTGGAG TGCTGTAGCC GAAAATAATA ATAAAAATTG GAGTGCTGTA
GCTGAAAATA ATAATAATTG GAGCGCCCCT ACTCAGAACA ATAATTGGAG TGGAACTACT
GCAACAAATA ATTGGAGTGC TCCTGCTCAA ACTTCATCTT ATAATTCATT GTATGGGGAA
GGTCGGCAAA ATTACTCCTC CAACGATAAT AGTTATTATT CTTCCAATGA TAATAGCTAT
TACTCTTCCA AGGATAATAG TTATTATTCA GATACTTCTA GTTATAGTCA AACAAGTTAC
TATTCAAACT CATCCTCCGG TCAAAATGGT TATTACTCCA GCACCTCTTA TGCTCCAAAC
TACTCAGACT CTTCAAGAGG AGTGGAGGTG GCAGCAGTAT TACCGGCTCA GGTGCAGGGA
CTTTTTCATA ACCCCATATC TCAGTCAGGT TTAGCTTATT ACAAACGCAC CATGCGACCT
CCGGCAGTAC CTGGAAATGG TAATACTCGG TTAATATTTC CTCTATCTGT TCCAGCACCT
ATTACTTCAT TATTTGGGTG GAGAGAACAT CCAGTTTTAG GTTATAGAAA GTTTCATACT
GGAACAGACT TAGGGGCACC AACTGGTACT CCAGTAGTGG CAACTTATGC AGGAAAGGTA
GCGATCGCTG ACTGGTTAGG TGGTTATGGT GTAGCTGTTG TGTTAGACCA CCAGAGAAAA
TCATTGGAAA CTCTCTATGG TCACTTATCA GAAATTTTTG TGCAGCCAGG GGAGTTTGTA
CAACAGGGTG AGGTTATTGG CAGGGTTGGT AGTACGGGAA TGTCTACGGG TCCCCATTTA
CATTTTGAAA TGCGACAGTT AACACAGGAT GGATGGGTGA CGAAAGATCC AGATAGTCAT
ATTGAGTTTG CTTTGGGGAA TTTGATGGCA GCATTGAAGG TTACGGAAGT ACCGCCTGTA
CCAGAGGTTA GTATTAATGA GTTGCTGAAG CAGACTAAAG ATAATGGTTC TGGTCTGCCA
CAACTGCCTC CTTTACCCCC AGGGTTTGAA GTACCTATTC CTGATTTAGA ACCGCCAATT
TTTGAGTTGA CAAAAGCTTC GGAAGAGTCT GATGAGGATG TCAAAGTTAG TTTGAATGAG
CAGGAAAAAA AAGTGGTGGA ATAA
 
Protein sequence
MSPENQNQSI ESKAEIQEIT QGPNQKQNLK ITKLFPRKSL MQLGCCLASL GMLSRGMVLG 
EVVLEIEVYD YDPSYQPAPV PQTSYSAPSY SPEPYYSPAP SYSPEPYYSP APSYSPEPYY
SPAPSYSPAP YYSQPYYEPA PYYSPAPSYS PEPYYSSPAP SYYEPEPYYS QPYYEPEPYY
SPEPYYSQPA PTYEPEPYYY SQPAPIYEPE PEPEPDSTIY LPKNLYVAPP TNTNINLSKK
SKNNLPPLTL PKPEKLYYYN GSTNAYIDET DYKVGATDSY EAPSTVVFSE RSTGEVAQYP
TNNKNWSAVA ENNNKNWSAV AENNNNWSAP TQNNNWSGTT ATNNWSAPAQ TSSYNSLYGE
GRQNYSSNDN SYYSSNDNSY YSSKDNSYYS DTSSYSQTSY YSNSSSGQNG YYSSTSYAPN
YSDSSRGVEV AAVLPAQVQG LFHNPISQSG LAYYKRTMRP PAVPGNGNTR LIFPLSVPAP
ITSLFGWREH PVLGYRKFHT GTDLGAPTGT PVVATYAGKV AIADWLGGYG VAVVLDHQRK
SLETLYGHLS EIFVQPGEFV QQGEVIGRVG STGMSTGPHL HFEMRQLTQD GWVTKDPDSH
IEFALGNLMA ALKVTEVPPV PEVSINELLK QTKDNGSGLP QLPPLPPGFE VPIPDLEPPI
FELTKASEES DEDVKVSLNE QEKKVVE