Gene Tery_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1543 
Symbol 
ID4242022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2343203 
End bp2347798 
Gene Length4596 bp 
Protein Length1531 aa 
Translation table11 
GC content35% 
IMG OID638106686 
Productprotein splicing site 
Protein accessionYP_721296 
Protein GI113475235 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.706415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT TACATAGTTT TTGGGTGACA GAACATGAAC AACCTAGTTT TTTATTTATT 
TGGGGAGAAG CTTGGCACAG AGTTACAGAA GAGGATGCTG GTGAAGTAGA AAAAATTACT
AATAATCCTT ATTCAATAAC TTTAGATGAA CTATTAAAGT TATCTGAGGC AAATATATAT
TTATCTTTAG AAAAGCACAA AAAATCCACA AATCAAACTT TAGGTATCCC CACAAAGTTA
TCAGAATCTA ACCAAAAATT ATACCCAATT CACTCTGCTA TAAGTTTATC AGAAATTCCA
GAAAATTTAT ATATTTATCC CTGGAAAATA GAAGGAATTT GTCTAGAACC TGATGAAGCA
ATTAAATTTT TACAATCTAT TCCTCTAGGA CAAACTACTG AATCTTTCAT CGGGTCAGAT
TTAAAGTTTT GGTCTCATCT TGCTCGATGG AGTCTTGACT TATTAGCAAG ATGTAAGTTT
TTACCTTCTA TTGAACAAAG ATCTATTACA CAAGAATTTA TTACAACTTG GCAACCTTTA
ATTGATAGTT CTATAGACCA AACAAGATTG AAAAATTTTG CCCAACAAAT GCCTCTTGTT
TGTCGTACAT ATAATCTTGA TTGGCAATTA CATCCTTCTT TAGAAAGTTT ACAAATAATT
ACTAAAAATA ATAGTTACTC AAATTTACCT CGACTTCAAT CAAGTCAAGA AATTATACAG
GATTTCTTGA AAAATACAAT AGATAAACAG ATTAGACAAC TATCAGCTGA AATATCTCTA
ACAGAAACTA CCTCATTAAA TTCTTCAATT AGACAATGGT TGAAGTCTTT ATCAGGAAAA
TTAAGTTTAA AATTACCTGC TCCTCAAGAA ACAAAAAAAA TTCAAAAAAT ACTTGATAAT
TGGAAGTCAC CTCTACAAGA ATATCAAGCT ATAGAAAATA AGTTTGTTGC CTGTTTTTGT
CTCCACTCAC CATCAAATAA TTCCCAACAA TGGAAGCTAG AATATTGTCT CCAAGGACTT
GATAACCCAG ATTTTTTGGT TGATGCTAAA ACTATTTGGG AAAATCCTGT GGCAAGTTTA
AATTATCAAG GTAAAACAAT TAAACTGCCT CAAGAAACTT TGCTTAAAGG TTTGGGTTTA
GCATCAAGAA TTTATCCAAT TATTGAGCCT AGTTTACAAG AAGCAACTCC CCAATATTGC
TTACTAACTT CACAACAAGC TTATGATTTT ATTAAAAGTG GAAGTGGGCG GTTTATTGAT
AGTGGACTGG GGGTAATTTT GCCTCCTAGT TTAGCTAACC GAGAAGGGTG GGCTTCTCGT
TTGGGCTTAA GTATTCAGGC TACAGCTCCA AAAATGAAAA AAACTGAAAA ACTTGGGTTG
AAAAGTCTTC TCAATTTTAA GTGGCAGTTG TCTATTGGAG GTCATAAATT AACTAAAGCT
GAATTTGAAA AATTAGTTTC TCAAGATAGT CCTTTAGTAG AAGTTAATGG TGAGTGGGTG
GAGTTACAAG GTCAAGATGT CCGAGCTGCT AAAAATTTCT TTGCCTCTCG TAAAGACCAA
ATGAGTTTAT CTTTGGAAGA TGCTTTGCGC CTGGCGACTG GGGATACTCA GACTGTGGAA
AAGTTGCCTG TTGTTAATTT TGAAGCTGGG GGTCAATTTC AGGAACTTTT AGATACTTTA
ACTAATAATC GTTCTTTAGA GGAAGTTTCT ACTCCTGAGA ATTTTCGGGG AGAACTGCGC
AATTATCAAG CACGAGGGGT AAGTTGGTTG AGTTTTCTGG AACGTTGGGG TTTGGGTGCT
TGTTTGGCGG ACGATATGGG TTTGGGTAAG TGTGTGTTGC ACGATACAGA AATTTATGTT
AATGGTATGG TAATGGAAGC AGAACAAATT TGGCAAGCTT ATGCAGGTGA GGCTGAATTT
GATGGGGAAG GGTTTTGGAC AGAGCCTAAT AAGGAGTTAT TGGTAAATTC TTTAGATGAG
ACAACTGGTA AAATTGTCTT TGCTCGGATC CGACGGTTAT ATCGCCAGTG GGTACGGGAA
AAGTTACGGA AGGTTAGGTT GAAGGATGGT AGTAGTATTA CTATTACTTG TAGGCATAAA
TTATTTATTC GCGATAGTTG GAAAAATGAT TTTCAGGTGG GAGATGATGT TTGTGTACCT
GCAAAGCTGA TGTGGGATGG GAAACCAGAA GATCCAGATG TGGTTAAGTT TGTTGCTTGG
CAAGTAGCTG AAGGATGGGA AAGGGTTAAT TCTGGAATGT TTGGTGTTTC TCAGAAGGGA
AAGGATGTTT TGGAAGGTTT ACTTGAAGTT TTTAGCAGAC TTGGAAAACG ATATGATATT
AAAATTAATT GCCCTAAAGT TGTTGCTCAT GGTAGCAAGA AAAATTGTTA TGAATTTAGT
GCGCATAGTT TAGAGTATCG GAAATTTTTG GAAGAAAAAA GATATGGCTG GGGGAAGCGA
TCGCATGAAA AAACTATCCC ACTATTTATA ATGCAGGCTG ACTTAGACAG CGTTAGAGTT
TTTCTGAGCA ACTACTTTGA TGCAGAGGGT TGGGTAAACA AAACTGTAAG ATGTGTCGAA
ATTTCTACAG CATCATCCCA ACTTATTCAA GAATTATCTA TCCTTCTCAG ACGCTTTGGG
GTTTGGATGA AAATTTCACC TCAACAGAAA TGCGCTACTA ATGGTACTGG AGTTTTTTGT
ACCTATTATA TTGGTACATT TGGTGGTAAT TCGGCACGCT GTTTTTTACA AGAAATAGGA
TTTAATGATT CAGGAAAACA AGAAAATCTT AAGTCAATTT GTGAAAAAAT TGCTGACTCT
AATGTTGAAG GAATTCCTGC TTCTGATATA GTTGCTGAGT TAGTAGAAAA AACTCAACTA
CCAGTGGGAA GTTTAGGAAT TCAGGATCCA ATTTATATGG ATGGTTGCCA AGATTTTTCT
CCTACTAGTT TAGAAAAAGT TATTAACAGT ATTGAGGATA TTATTAGTGG AGCTGGGGAA
GAGGAGTATG GTCAACTGAA ATCTTCAAAA TTGAGGAATA AAACTTTAGA GGCTTATTCT
TTACTGAATA TTTTAGAGCT AGAAATTTAT AAAACTAGGC TGCAAAAATT ACTCAACCAA
GAGGTCTATT ATTGTCAGAT AGAATCTATC GAAGAGATGG AATATGAGGG ATGGGTTTAT
GATTTTGAAG TGAGCAAATA TCATAATTTT GTTGCTAATA ATATTATTTG TCATAATACT
ATTCAAACTA TAGCTTTTCT ACTGAAACAA CAAGAGCAAA AAGCACTTAA AGGACCTACG
TTATTAGTTT GTCCTACTTC TGTATTAGGA AATTGGGAAA GAGAGGTCAA AAAGTTCGGA
CCGACTTTAA AAGCAATAGT ACATCATGGG GATAAACGTG CTAAAGGTAA AGGGTTTGCT
ACAGCAGTAA AAGATACAAA TTTAGTCATA ACAAGTTATG CTTTATTGCA TCGAGATGAG
AAAATATTGG AGACAATAAA ATGGCAGTCA GTGATAGTTG ATGAAGCCCA AAATATTAAA
AATCCAGAGG CAAAACAATC TCAAGCAGCT AGAAAGTTAG ATGCATCATT TAGAATTGCT
TTAACAGGAA CTCCTGTAGA AAATCGCCTT TCAGAATTAT GGTCAATTTT GGATTTTTTG
AATCCTGGAT ATTTAGGTCA AAAGCAATTT TTCCAGCGTC GGTTTGCTAT TCCTATTGAG
AAATATGGCG ATACAAGTTC GTTGCAAATT TTGCGTTCTC TTGTTCAACC ATTTATTCTC
CGTCGCCTTA AAACTGATAA AGATATTATT CAAGATTTAC CAGAAAAACA GGAAAATACT
ATTTTTTGCC CACTGGCAAA TGAACAAGCA TTGCTTTATC AAAATATAGT TGAAAATTCT
TTGGCAGAAA TTGATACTGT TGGTGGTATT CAGCGAAAAG GAAAAATTTT AGCTTTGTTA
ATAAAACTGA AACAGTTGTG TAATCATCCT GTACTTTTAC AAATCAAAAA AGGTAGTAGG
AAAAAGGTAG AAATTACTGA TAAAAATTCA GGGAAATTAC AACGTTTAGG GGCAATGTTG
GAAGAAATAA TTTCTGAGGA AGAACGGGCA ATTATTTTTA CACAATTTGC TGAATGGGGA
AAGGTTTTAC AACCTTATTT GCAGAAAAGT TTAGGTAGGG AAGTTTCTTT TTTATATGGT
TCTACTCAGA GGAGTAAAAG GGAGGAAATG ATTGATCAAT TTCAACTGGA TCCTCAAGGT
CCTCCTGTCA TGATTTTGTC GTTAAAAGCT GGAGGTACTG GTTTAAATTT GACTCGGGCT
AATCATGTTT TTCATTTTGA TAGATGGTGG AATCCTGCAG TGGAAAATCA GGCTACGGAT
CGGGTATTTA GAATTGGTCA AACTCGAAAT GTTCAGGTGC ATAAGTTTGT TTGTACTGGA
ACTTTGGAGG AGAAAATACA TGATTTAATT GAAAGTAAAA AGGAGTTGGC TGAGCAGGTT
GTAGGTGCGG GTGAAAAATG GTTGACTGAA CTTGATACGG ATCAGTTGAG AAATTTATTA
ATACTTGACA GAAATCAAGT GATTAAAGAG GAATAA
 
Protein sequence
MAILHSFWVT EHEQPSFLFI WGEAWHRVTE EDAGEVEKIT NNPYSITLDE LLKLSEANIY 
LSLEKHKKST NQTLGIPTKL SESNQKLYPI HSAISLSEIP ENLYIYPWKI EGICLEPDEA
IKFLQSIPLG QTTESFIGSD LKFWSHLARW SLDLLARCKF LPSIEQRSIT QEFITTWQPL
IDSSIDQTRL KNFAQQMPLV CRTYNLDWQL HPSLESLQII TKNNSYSNLP RLQSSQEIIQ
DFLKNTIDKQ IRQLSAEISL TETTSLNSSI RQWLKSLSGK LSLKLPAPQE TKKIQKILDN
WKSPLQEYQA IENKFVACFC LHSPSNNSQQ WKLEYCLQGL DNPDFLVDAK TIWENPVASL
NYQGKTIKLP QETLLKGLGL ASRIYPIIEP SLQEATPQYC LLTSQQAYDF IKSGSGRFID
SGLGVILPPS LANREGWASR LGLSIQATAP KMKKTEKLGL KSLLNFKWQL SIGGHKLTKA
EFEKLVSQDS PLVEVNGEWV ELQGQDVRAA KNFFASRKDQ MSLSLEDALR LATGDTQTVE
KLPVVNFEAG GQFQELLDTL TNNRSLEEVS TPENFRGELR NYQARGVSWL SFLERWGLGA
CLADDMGLGK CVLHDTEIYV NGMVMEAEQI WQAYAGEAEF DGEGFWTEPN KELLVNSLDE
TTGKIVFARI RRLYRQWVRE KLRKVRLKDG SSITITCRHK LFIRDSWKND FQVGDDVCVP
AKLMWDGKPE DPDVVKFVAW QVAEGWERVN SGMFGVSQKG KDVLEGLLEV FSRLGKRYDI
KINCPKVVAH GSKKNCYEFS AHSLEYRKFL EEKRYGWGKR SHEKTIPLFI MQADLDSVRV
FLSNYFDAEG WVNKTVRCVE ISTASSQLIQ ELSILLRRFG VWMKISPQQK CATNGTGVFC
TYYIGTFGGN SARCFLQEIG FNDSGKQENL KSICEKIADS NVEGIPASDI VAELVEKTQL
PVGSLGIQDP IYMDGCQDFS PTSLEKVINS IEDIISGAGE EEYGQLKSSK LRNKTLEAYS
LLNILELEIY KTRLQKLLNQ EVYYCQIESI EEMEYEGWVY DFEVSKYHNF VANNIICHNT
IQTIAFLLKQ QEQKALKGPT LLVCPTSVLG NWEREVKKFG PTLKAIVHHG DKRAKGKGFA
TAVKDTNLVI TSYALLHRDE KILETIKWQS VIVDEAQNIK NPEAKQSQAA RKLDASFRIA
LTGTPVENRL SELWSILDFL NPGYLGQKQF FQRRFAIPIE KYGDTSSLQI LRSLVQPFIL
RRLKTDKDII QDLPEKQENT IFCPLANEQA LLYQNIVENS LAEIDTVGGI QRKGKILALL
IKLKQLCNHP VLLQIKKGSR KKVEITDKNS GKLQRLGAML EEIISEEERA IIFTQFAEWG
KVLQPYLQKS LGREVSFLYG STQRSKREEM IDQFQLDPQG PPVMILSLKA GGTGLNLTRA
NHVFHFDRWW NPAVENQATD RVFRIGQTRN VQVHKFVCTG TLEEKIHDLI ESKKELAEQV
VGAGEKWLTE LDTDQLRNLL ILDRNQVIKE E