Gene Tery_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4237 
Symbol 
ID4245889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6542893 
End bp6546090 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content35% 
IMG OID638109132 
Producthelicase-like 
Protein accessionYP_723710 
Protein GI113477649 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00803691 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAACT TCCGTTCCCA TTATTGGTCT ATTAGCTACT CCAGCAATGA AAATAATCCT 
ATTGCTGACT TCTATATCCC AGCCTTAGAA TGTGCTATTC AATATGATAG AAAATCTGGT
TTTTTTAGTA GTGCTATTCT CAGTAAAGTA ACCAGAGGAT TAGGAGCAAT GCTTCAAAAT
GAAGGTAAAA TACGTCTGAT TATGGGATGT CAATTTAACC CTCAAGATTT AGAAACCATT
CAAAAAGGTT ATGAGTTGAG AAAAGCTCTA ACCCTTCGTC TTGATGCTGA CTTAAAACCA
CCAGAAAATT TTGCCCAGCT CAAACATTTT GAAATTCTTA GTTGGTTAAT TTCTACAGGT
TATTTAGATA TTAAAATTGC CATTCCTCTT AAAAATAACG GACTGCCAGA AGCAAGCGAT
CAACAATTAG ACCCCCAACA TATTTTCCAT GAAAAAGTTG GAATTTTTAC GGATAAAAAT
GGCGACCAAA TAGCCTTTAG TGGTTCTAAT AATGAATCAT TAGGAGGATG GGAACAAAAC
GTAGAATCGT TTCATGTTTA TTGTGCTTGG GAAGGAGAAA GAGACTTTGA TAGAGTACAA
GAAGAAGTAT ATAGATTTGA GAAACTTTGG CATAATTTAG CTCCTAATGT GAAAGTATTT
AATATTCCAG AAGCCGTTGC AGAAAAACTT TTACGTTATA CTCCCTCTAC TAAACCCACT
TGGAATGAAA AAGTTGAATT TGACACAAGA CCATTACCAT CAAAACTATC ATATTTTTCT
GACAACAAAT CCGAGGAAAA AGAGGAAAGT AAAAACATAT TTTCTACAGA TAATATTGTC
GAGAAAACCA TCACAAATAT CTCAGAAGCA GACAAAGAAA AAGAACTTCA AGCTTTTAAT
ATTATCATAA ATTCTCATCA ACATCCTGGT TGCCTAGACT ACTGCTTACA ATCCATCACA
ATCGACCCTT GGCCTCATCA AATAAAAATT CTTAAACAAG TTGCTAAAAA ATTCCCCTGT
AACTTTCTCA TAGCAGATGA AGTTGGATTA GGAAAAACAA TTGAAACAGG TTTAATCTTA
CGTTATTTAC TCCTCACAAA AGCCATCCAA CGAGTTTTAG TATTAGCACC AGCTAGCGTT
CAACCACAAT GGCAAGAAGA ACTCAGAGAA AAATTTAATC TGCACTTTTG GAGTTACAAC
AAAGGAGAAT TTAAAGACCC CTACGATAAC ACTTCAACTA TTAACAAAAC TAACCCTTGG
AACAGTCATA AACTAATATT AGCATCCTCC CATTTAGTCA GACGTGCTGA GAGAATGACA
GAAATATTAG CAGCAGAAAA ATGGGATTTA ATTATTTTAG ATGAAGCTCA TCATGCTCGT
CGCCAAGCCC CCCAAAATAG GAAAGAAACC CCTAACTCTC TCTTGAAATT AATGCAGCAG
TTACGAGAAA AAACTAAATC TTTAATCCTA TTATCTGCCA CTCCTATGCA AATAGATCCT
ATCGAAGTAT TTGACTTATT AGAACTTTTA GGATTAAAAG GACATTGGAG TTATGGGGAT
AATTTTTGTA ACTATTTTGC CTCTTTATCA GGTAAACCAG ACGAGCATAC TTTCAATTTT
TGGCAGGTAA TGACTACAGA TTATTTTAGA CTAGGAGGAG TACCTTCCTC TCAGTTTGCA
TCCTATCTAA ATAAAAGCGA TCGCTTAATG TCCTACCGCC TCCGAGATAT TTGGCAACAA
GGGAAAAAAA TAGTTAATCA TAAAAAACTA GCACAGGATA AACCCTTTGT GAATACTTCC
CGCCAATACC TAACTATTAA TACTCCCCTC AAAGACTTGA TGTTTCGTCA CACCCGGGAC
ACCTTACGTC AATATTATCG CCGTGGAATT CTCAATAGAG ATATTCCCAA ACGAGTTGTA
CAGGATAAAG CCATTATATT AGAACCCAAC AGAGAAGTAC AATTATATGT AGAAGTTAGT
GACTATGTAC GTCATTTCTA TAAATTAGCT CAAAAAGAAA ATCGTCAGGC GCTAGGTTTT
CTAATGACTC TTTACCGCAA ACGTCTCACC AGCTCGTTTT TTGCAATTCG GGAGTCATTG
CAACGTCGTT TGGATGGTAT TAGTATTACC GCTGACGACT TGGGAGACTT AGATGATGCT
GACGATGCTA TTATTACAGG ATTAGAAAGT TATTGCCAAA CTGAAACTGT AGACCCCCAG
GAAATTGAAT ATTTAGAAAG TCTATTGTTG CAGTTTGAAA ATACTGGTGA AGATACTAAA
CTATCTCACT TCATCACAAC TTTAAGAACA GAATTAATAG ATAGAGACAG TGTAATTATA
TTTACCCAAT ATACCGACAC AATGGATTAT TTACGACGGA CTTTAAAAGA TTTATATGGT
AGTCAAATTG CCTGTTATTC TGGACGCGGT GGAGAAGTTT ATCAAGATCA AAAATGGTGT
CTAGTTCCTA AAGAGAAAAT CAAACAAAAA TTTCGAGCTG GAAGTATAAA AATATTATTA
TGTACTGAGT CAGCATCCGA AGGTTTAAAC TTACAAACCT GTGGAGTAAT TATTAATTAT
GATATGCCTT GGAACCCAAT GCGAGTTGAA CAGAGAATTG GTAGATTAGA TAGAATAGGT
CAATTTTATC CCACTGTGAG AATTTATAAC TTTTATTATG ATGGAACAGT AGAAGCTAAA
GTTTATAAAA AACTGCGCGA TCGTATTGAT ACTTTTCAAA ATATAGTAGG CAACCTTCAA
CCTATTTTAG CCAAAGTTCC TACTTTTATA GAGAAAGCAG TTATGAGTGC CGACCCTGAA
GAAGAAAATG TATTAATGTC TGATTTTGAT AAAAATGTGT TAAATACTCC TCCATTAAGA
CCTGCTTTAG ATGAGATGGT AGCCATGGAT GTAGAGACAG ACTTGACGGA AATTAGACAA
CCGCTAATTC CTACAAATTT ATCTGCTGAA CTAATAGAAA ACTTATTTAC TGATTCGTTA
CTATTAAAAT TATCAGGGAT AAAATTTACA TTTTTAGAGG ATAAATTGTG GCAGTTAAAC
TATCAAAATA GTAATTATCA AGTAACATTT GATGTGAATG TTTTTGAGGG TAAACCATCT
GTAAGATTTA TGAGCTTTGG AGAACCTTTA TTTGAAGAAT TGTTAGGGTT AATAATGAAT
AATAGCAGCA ACCTTTAA
 
Protein sequence
MPNFRSHYWS ISYSSNENNP IADFYIPALE CAIQYDRKSG FFSSAILSKV TRGLGAMLQN 
EGKIRLIMGC QFNPQDLETI QKGYELRKAL TLRLDADLKP PENFAQLKHF EILSWLISTG
YLDIKIAIPL KNNGLPEASD QQLDPQHIFH EKVGIFTDKN GDQIAFSGSN NESLGGWEQN
VESFHVYCAW EGERDFDRVQ EEVYRFEKLW HNLAPNVKVF NIPEAVAEKL LRYTPSTKPT
WNEKVEFDTR PLPSKLSYFS DNKSEEKEES KNIFSTDNIV EKTITNISEA DKEKELQAFN
IIINSHQHPG CLDYCLQSIT IDPWPHQIKI LKQVAKKFPC NFLIADEVGL GKTIETGLIL
RYLLLTKAIQ RVLVLAPASV QPQWQEELRE KFNLHFWSYN KGEFKDPYDN TSTINKTNPW
NSHKLILASS HLVRRAERMT EILAAEKWDL IILDEAHHAR RQAPQNRKET PNSLLKLMQQ
LREKTKSLIL LSATPMQIDP IEVFDLLELL GLKGHWSYGD NFCNYFASLS GKPDEHTFNF
WQVMTTDYFR LGGVPSSQFA SYLNKSDRLM SYRLRDIWQQ GKKIVNHKKL AQDKPFVNTS
RQYLTINTPL KDLMFRHTRD TLRQYYRRGI LNRDIPKRVV QDKAIILEPN REVQLYVEVS
DYVRHFYKLA QKENRQALGF LMTLYRKRLT SSFFAIRESL QRRLDGISIT ADDLGDLDDA
DDAIITGLES YCQTETVDPQ EIEYLESLLL QFENTGEDTK LSHFITTLRT ELIDRDSVII
FTQYTDTMDY LRRTLKDLYG SQIACYSGRG GEVYQDQKWC LVPKEKIKQK FRAGSIKILL
CTESASEGLN LQTCGVIINY DMPWNPMRVE QRIGRLDRIG QFYPTVRIYN FYYDGTVEAK
VYKKLRDRID TFQNIVGNLQ PILAKVPTFI EKAVMSADPE EENVLMSDFD KNVLNTPPLR
PALDEMVAMD VETDLTEIRQ PLIPTNLSAE LIENLFTDSL LLKLSGIKFT FLEDKLWQLN
YQNSNYQVTF DVNVFEGKPS VRFMSFGEPL FEELLGLIMN NSSNL