Gene Tery_1179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1179 
Symbol 
ID4244063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1844523 
End bp1847486 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content40% 
IMG OID638106398 
Productpeptidase M16C associated 
Protein accessionYP_721010 
Protein GI113474949 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0156529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTAT TAACAGAAAG AACAATTGAA GTTGGACAAA AACTACAAGG CTTTGAAGTC 
AAAGCTATTA CTGACCTGAA GCAACAGCGA ATGGTGGCAT ATCAGCTAGA ACATATCAAA
ACAGGTGCAA AACTGTTACA CCTATATTCA GAAGATGCTG AAAATCTCTT TTCAATTAGT
TTTCCCACTC CCCCTCCAAA TAGCACAGGA GTATCCCATA TCCTAGAACA TTCTGTATTA
GCAGGTTCAA AAAAATATCC TGTACGCGAA CCATTTTTTG AAATGCTAAA AATGAGTCCA
GCAACCTTCA TTAATGCCAT GACTGGCCCT GACTGCACTT ACTATCCAGT TTCCAGTAAA
GTTAAACAAG ACTTATTCAA CTTGGCAGAA GTATACTTTG ACGCTGTATT TCACCCACTG
CTAACCGAAA ATACGTTTAA ACGAGAAGGA CACCACTTAG CTCCTACCGA CAAGGAAAAC
CCAACAGGGG AATTAAAATT TACTGGGGTA GTTTACAATG AAATGAATGG AGCTTTTTCT
GACCCTGAAC AAAGGTTAGA TAGTATTGCT AACCAAAGTC TATTTCCTGA TAACATTTAT
GGTCTTGAAT CTGGAGGGAA CCCCCAAAAT ATTCCTGAAC TCACTTACAA AGACTTCCGT
GACTTCCATT CCAGCTATTA TCATCCCAGT AATGCTTACT TTGTTTTTTA CGGCAATATT
TCTACTCCTG AATATCTTGA ATTTTTAGCC AAAAAGCTAG AAGCTTTTGA AAGACAGAAA
CCTAATATCA ATATTAATCC TCAATCTCGC TGGAGTGAAC CTCGTTTTAA AGAAGATTCC
TACCCTATTA GTGCAGCGGA TGAAACAACA CAAAAAACTT ACATAATGAT CAAATGGTTG
GTGGGAGATA GCACTGACTC TGAGGAATGG GTAGCTTTAG ATATTCTCAG TCGCATCTTG
TTGGGAAATG AAGCGGCACC ATTGAAAAAG GCGATTGTTG AATCCCAAAT TGGTCAAGAT
CTACTCGGCT CTGGAGTAGA TTCTGTGGGT AAGGAGGTTA CTTTTCATCT GGGTATTCAA
GGAAGCGAAC CTAACCAGGG TGAAGCATTC AGTCAGTTAG TAATCAAGAC TTTAAAAGAA
ATTGCTGAAG AGGACATTGA ACCTAGTATT GTTGAGGCAG CGTTTCAACA GGCAATTTAC
CAACACCAGG AAATAGGTAG TATGTATCCT TTACGGATGT TGTTTCGGGT GATGCAAACC
TGGATTTATA GTAATGATCC ATTGAAGTTT TTACATATTA GCGATCGCCT CGCCGAATGC
AAACAACGTT ACCTGGAAAA ACCTAGATAT TTTAATAACC TAATTCGTGA AAAATTACTC
AATAACCCCC ACCGTTTGAC ACTGGTATTA AAACCAGATA AAGAATGGCA ATCTAATTAT
GACAAAGCAG TAGTAGCACA GGTGGAACAA GTACGTTCTC AATTAACTTC AGAAGAATTA
GAACGCATAG CTACTGAAGC AACAGAATTA GAAATAGAGT CAGGAACTCC TAATTCTCCT
GAAGAAATAG CTAAACTGCC CCAACTACAA GTGAAGGACT TACCTGACAA ACCAGAACAT
ATTCCTACTG ATGTCGAAGA ACTTGATGGC CAAGTGACAC TTTTAAGGAA TCATGTGCTA
GCAAATGGAG TAAATTATTT ACAACTAGAT TTCAGTTTGC GTGGGTTGCC TGAAGATTTG
TGGTTGTATC TATCTATTTA TATAGATGCA CTGCGGAAGT TGGGTGCAGG AGAAATGAAC
TACGAACAGG TGGCTCGCGG CATTGCTTCT TATACCGGGG GAATTAGTTT TCAATCTCTG
TTACGGACTT CTACTAAAGA TGCCTATCAT TCTGTGCGTG GACTTCGCGT CACCATTAAA
ACCCTAGATG AACAGATTGA ACCAGCATTA GAGTTGTTAC ACAACATGAT TTTTGCAGTT
AATCCTCGGG ATACAGCTCG ACTACGGGAA GTGATGATTC AGTCTTATTC TCAATCTAAT
TCAGATTTGA TTTATAACGG TATCTACACT GCTATATTGC GAGCTAGTGC TGGTATGACA
TCAGAAGCTA AGATTAGTGA GATTGTTAAC GGTTTACCTC AACTGGAGTT GTTGAAAAAA
GTATGCGATC GATTTGATGA ACATGGGGCA AATTTGATGA GCAAGGTTGA AACTATCCGG
GATTATGTAG CCAATCAACC TTTGACTGCT AGTTTTACTG GTTCAGATAA TGCTTATAAT
GTGGTCAAGA AGACTCTCTC GGAGTGGGGT CATCAGCAAA AGCAACAAGA AGGAGATACT
TTTGGTAGTC GCTTTGAACC AGTTTATAAT ATGCGGGAAG CTTTAGCAGG TCCAGTACAG
GTGGCTTATT GCGTTCAGAC TATGCCAGCT CCTCACTTTA GTGATGAAAG AGCGCCATTT
TTAAGGTTAG GTACTCATTT ATTAGGTTTG GGTTATCTAT TTACAGAAGT TCGTCTTAAG
GGTAATGCTT ACGGTGCAGG ATGTCGTTAT AGTGGTTTAG GAAAAGTTAT TTCTCTCTAT
TCTTATCGCG ATCCTCATGT CAGTCGTACT CTTGATGTAT TTGCTGGTTT GATAGATTAT
CTTAAGGATG TAGATTGGAC TCAGATTGAT GTTGACCGGG CAATTATTGC TACAATTCAA
GATGATTCTC CAGTTTTGCG CCCAGAAGTA GCCACAAGCT TAGCTTTGGA ACGTCATTTG
ATAGCGCAAA CTGCTGAACT TAGGGAGGAA CGTTATCAGC GAACGCTCAA AGCTACAGTT
GCAGATGTGA AAGAAACTTT GTTAGATGTT TTTACTGCGG GGATGGAACG TAGTAATGTT
TGCGTAATGT CTTCTCGCGA AAAATTGGAA GAAGCAAACC GTTCTCGGGA GGCGGATCCA
TTGACAATTT CTGATATTAT GTAA
 
Protein sequence
MPLLTERTIE VGQKLQGFEV KAITDLKQQR MVAYQLEHIK TGAKLLHLYS EDAENLFSIS 
FPTPPPNSTG VSHILEHSVL AGSKKYPVRE PFFEMLKMSP ATFINAMTGP DCTYYPVSSK
VKQDLFNLAE VYFDAVFHPL LTENTFKREG HHLAPTDKEN PTGELKFTGV VYNEMNGAFS
DPEQRLDSIA NQSLFPDNIY GLESGGNPQN IPELTYKDFR DFHSSYYHPS NAYFVFYGNI
STPEYLEFLA KKLEAFERQK PNININPQSR WSEPRFKEDS YPISAADETT QKTYIMIKWL
VGDSTDSEEW VALDILSRIL LGNEAAPLKK AIVESQIGQD LLGSGVDSVG KEVTFHLGIQ
GSEPNQGEAF SQLVIKTLKE IAEEDIEPSI VEAAFQQAIY QHQEIGSMYP LRMLFRVMQT
WIYSNDPLKF LHISDRLAEC KQRYLEKPRY FNNLIREKLL NNPHRLTLVL KPDKEWQSNY
DKAVVAQVEQ VRSQLTSEEL ERIATEATEL EIESGTPNSP EEIAKLPQLQ VKDLPDKPEH
IPTDVEELDG QVTLLRNHVL ANGVNYLQLD FSLRGLPEDL WLYLSIYIDA LRKLGAGEMN
YEQVARGIAS YTGGISFQSL LRTSTKDAYH SVRGLRVTIK TLDEQIEPAL ELLHNMIFAV
NPRDTARLRE VMIQSYSQSN SDLIYNGIYT AILRASAGMT SEAKISEIVN GLPQLELLKK
VCDRFDEHGA NLMSKVETIR DYVANQPLTA SFTGSDNAYN VVKKTLSEWG HQQKQQEGDT
FGSRFEPVYN MREALAGPVQ VAYCVQTMPA PHFSDERAPF LRLGTHLLGL GYLFTEVRLK
GNAYGAGCRY SGLGKVISLY SYRDPHVSRT LDVFAGLIDY LKDVDWTQID VDRAIIATIQ
DDSPVLRPEV ATSLALERHL IAQTAELREE RYQRTLKATV ADVKETLLDV FTAGMERSNV
CVMSSREKLE EANRSREADP LTISDIM