Gene Tery_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3191 
Symbol 
ID4243863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4874834 
End bp4875961 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content33% 
IMG OID638108197 
Productpeptidase M50 
Protein accessionYP_722788 
Protein GI113476727 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00586872 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGGAT CTTTTCGTGT CGGCAACCTA TTTGGCATAC CATTTTACAT TAACTCATCC 
TGGTTTATAG TCTTAGGTCT CCTTACTTTA ACTTATGGTA ACGACCTAGC AACTCAATTT
TCTCAAGAAT TGGGTAATAC CTTACCTTGG ATACTAGGAT TAATAACAGC ATTATTATTA
TTTTCCTCTG TCTTAGCCCA TGAGTTAGGG CATAGTTTTG TTGCTCTATA TCAGGGAATA
AAAGTAAAAT CAATTACCCT ATTTCTCTTC GGAGGTTTAG CTAGTTTAGA TAGAGAATCT
AAGACTCCTA TAGAAGCATT TTTGGTAGCA ATTGCTGGCC CTTTAGTGAG TATATTATTA
TGCGGTTTCT TTGTATCAAT TAATATATTT ACATCCATTA CTGGACCAGC AGAATCCATT
GTTCAACTTT TAGCTTATAT CAACTTATTC CTAGCATTAT TTAACCTAAT TCCAGGTTTA
CCACTTGATG GTGGTAATAT CCTTAAATCT ATTGTTTGGA AAATCACTAA TAACCCTTAT
AAAGGAATTA TTTTTGCAAG TAGAGTAGGT CAAGTATTTG GTTGTTTAGC AATAATTTCT
GGTTTAATTC CCGCATTTTT ATTTAGTAGA ATTCCTAATT TTTGGAATAT TCTCATTGGT
TGGTTTCTAC TACAAAATGC TGGTCGCTCT GCCCAATATG GAGAAATTCA AGGTATGCTT
GCTGATTTAA ATGCAGTAGA TGCTATTATT CCTGATAATC CAATTGTATC AAACAATCTC
TCTTTACGAG AATTTGTGAA TGAATATGTT ATTGGGAAAG AAGCTAGAAA GAAGTTTTTA
GTGATAAATG AAATGGGGCA GTTTGTAGGA GTAATTAACG TTGATGATTT AAAAATAGTT
AATACATCCC AATGGCCTTT GGTTCAGGTA AAAACATTAA CAAAACCTTT AGCAAAGATA
GAGACTGTAA CCGCTAAAAC TTCTTTATTA GAAGTAATTT CTTTGTTAGA GCAAAAGCAA
ATCAGTGAAC TAACTGTTAT TGATGAAAAT GGGATCTTAG TTGGGTCTAT TGAAAAAGCT
TCAATTAGGC GTTTGTTAAC AAGAAAGGAG CAAGCTAAAA CTAATTAA
 
Protein sequence
MNGSFRVGNL FGIPFYINSS WFIVLGLLTL TYGNDLATQF SQELGNTLPW ILGLITALLL 
FSSVLAHELG HSFVALYQGI KVKSITLFLF GGLASLDRES KTPIEAFLVA IAGPLVSILL
CGFFVSINIF TSITGPAESI VQLLAYINLF LALFNLIPGL PLDGGNILKS IVWKITNNPY
KGIIFASRVG QVFGCLAIIS GLIPAFLFSR IPNFWNILIG WFLLQNAGRS AQYGEIQGML
ADLNAVDAII PDNPIVSNNL SLREFVNEYV IGKEARKKFL VINEMGQFVG VINVDDLKIV
NTSQWPLVQV KTLTKPLAKI ETVTAKTSLL EVISLLEQKQ ISELTVIDEN GILVGSIEKA
SIRRLLTRKE QAKTN