Gene Tery_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1038 
Symbol 
ID4242001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1625440 
End bp1627242 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content34% 
IMG OID638106271 
Productpeptidase M61 
Protein accessionYP_720883 
Protein GI113474822 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.316472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAG CTAAAATATT AACCATCAGT CCAACAATTA CAAGCCCAGC AATTCAATAT 
AAAGTATCTA TGCCTCATCC AGAATCTCAT CTGTTTGAGG TTAGTTTGTC TGTAAGAGTT
GAAGAATTAT CTTCCTCATT ATTACAAATG TCCAAAAAAC TGGATTTAAA AATGCCAGTA
TGGACACCAG GTTCTTACTT AATCAGGGAA TATGCTAAAC ATTTGCAAGA TTTCTGTGCC
TATAGTGAAA ATAAACAACC TTTACCTTGG CAAAAACTTA GCAAAAATCA CTGGCAAATA
GAAACATTGG GAGTCTCAAA AGTAATAGTT CAGTACAAGA TATTTGCTAA TGAATTAACA
GTGCGCACCA ATCATTTAGA CTCTACACAT GCTTATTTTA ACGGTGCAGC TTTGTTCTTT
TATATTCCTG AATGTGAAAA AAATAAGATT AGGCTCGAAG TTATTTCACC ATTACCTAAT
TGGCAAATTA CGACATCTTT ACCAAAGACT CCAAATACAG AAAATACATT TGAAGCAGAA
GATTTTGATA CTTTAGTAGA TAGTCCTTTT GAGATAGGTA ACCATCAATT ATATCAATTT
GAAGTAGAAG GAAAAAAACA TCAATTAGCT ATTTGGGGAA AAGGTAATGC AGAGCCAGAA
AAATTAATTC CAGATATACA AAAAATTATT GCAGTAGAAG CAGAGTTTTT TGGTGGTTTG
CCTTATGAAG AATATTTATT TATTTTGCAT AGTTCTAGTA AAGGATTTGG TGGTTTAGAA
CATAAGTTTA GTTGTACCTT AAATTATCCG AGATTTGGTT TTAGGAATAA GGAAAAACGT
GATCGGTTCA TGCAGCTAGT TGCCCATGAA TTTTTCCACT TGTGGAATGT TAAACGTATC
CGACCTAAAG CATTAGAAGA GTTTGATTAT GACCAAGAAA ATTATACTCC TTCTCTGTGG
TTTTCTGAAG GTACAACTAG TTATTATGAC TTATTAATTC CTCTAAGAGC AGGTATTTAT
GATGTTCAAA CTTTCTTGAA AGAATTAGGA AAAGAAATTA CACTTCTGCT AACAACAATA
GGAAGAAAAG TACAACCTGT AAGTGAGTCT AGTTGGGATG CTTGGATTAA ATTATATCGT
CGGGATAATA ATAGTAACAA CTGTCAAATT TCCTATTATT TAAAGGGAGC AATGATATCT
TTATTACTTG ATTTGTTAAT TCGAGAAAAA TATGAAAATC AACGCTCACT AGATGATGTA
ATGTATCAAA TGTGGGAGAA ATTTGGTAAG TCAGAAATAG GTTTTACTCC AGAACAATTG
AAAGCTGTAA TTGAAGAGGT AGCAGAATTA GATTTGGGCA ACTTCTTTAA GAGATATATT
GATGGTTTAG ATGAGTTACC TTTTGATGAA TACTTCGGGC ATTTTGGCCT GCAACTTAAA
AAAGAAGATA ATGAATGGCC TGATTGGGGT ATGAATGTTG TTAGTGAAAA TAATAAAGAA
ATAATTAAGT TTGTAGAAAA TAACGGGCCA GCACAGTTGG CGGGAATAAA TGCAGGAGAT
CAGTTACTGG CAATAAATGG TTTTCGGGTA AATGCAGATA AGTTGGGCTA TCGCCTCAAA
GATTATCAAC CAGGAGATAT TTTGGAAGTA ACTGTTTTCC ATCAAGATGA GCTTATTACT
CATCAGATAA CTTTGGCTCA CCCCGGTCCT AGTCGTTACC AATTGGTTCC AGTGAAAAAT
CCTACGGCAA CACAAGAAAA AAATTTTGTT GGGTGGTTGG GAAGTTCATT AGAGTCTATT
TGA
 
Protein sequence
MTEAKILTIS PTITSPAIQY KVSMPHPESH LFEVSLSVRV EELSSSLLQM SKKLDLKMPV 
WTPGSYLIRE YAKHLQDFCA YSENKQPLPW QKLSKNHWQI ETLGVSKVIV QYKIFANELT
VRTNHLDSTH AYFNGAALFF YIPECEKNKI RLEVISPLPN WQITTSLPKT PNTENTFEAE
DFDTLVDSPF EIGNHQLYQF EVEGKKHQLA IWGKGNAEPE KLIPDIQKII AVEAEFFGGL
PYEEYLFILH SSSKGFGGLE HKFSCTLNYP RFGFRNKEKR DRFMQLVAHE FFHLWNVKRI
RPKALEEFDY DQENYTPSLW FSEGTTSYYD LLIPLRAGIY DVQTFLKELG KEITLLLTTI
GRKVQPVSES SWDAWIKLYR RDNNSNNCQI SYYLKGAMIS LLLDLLIREK YENQRSLDDV
MYQMWEKFGK SEIGFTPEQL KAVIEEVAEL DLGNFFKRYI DGLDELPFDE YFGHFGLQLK
KEDNEWPDWG MNVVSENNKE IIKFVENNGP AQLAGINAGD QLLAINGFRV NADKLGYRLK
DYQPGDILEV TVFHQDELIT HQITLAHPGP SRYQLVPVKN PTATQEKNFV GWLGSSLESI