Gene Tery_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4989 
Symbol 
ID4246644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7626517 
End bp7630248 
Gene Length3732 bp 
Protein Length1243 aa 
Translation table11 
GC content47% 
IMG OID638109800 
ProductNB-ARC 
Protein accessionYP_724376 
Protein GI113478315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.700423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCTA TCCACGGTTT GGGTGCTGTG GGTAAGTCTA CTCTGGCAAC CGCTTTGGCT 
TATGATGGGG ATGTACAATC TCGTTTTTGT GATGGTATTC TCTGGGTGAC TTTGGGCCAA
GAGCCTAATA TTTTGCCTAT GCTTGGTCGT TGGGTCCAGA AGTTGGGTGA CTATGATTTT
AAACCGACAA GTGTGGAGGT GACTGTTAAT CATCTGCGGA TTTTGCTTTC TGATAAGGCT
GTGCTTTTGG TGGTGGATGA TGCTTGGAAT TCGGACCATG CTCAAATATT TAATCTGGGT
GGTCCCCGTT GTCAGGTTTT GGTGACTACT CGGGAAAGGG CGATCGCTGA GGCTTTGAGT
GCTAAGACTT ATAGTCTGGA TGTGATGACC TTGGAGCAGT CGATGTTGTT GTTGAGTGGG
AGGTTGGGCC GTGAGATTTG GGGAGAGGAG GCTCGGCAGG CGGAGGCTTT GGTTCAGGAG
TTGGGATATT TGCCTTTGGC TTTGGAGTTG GCGGCGGCTC AAGTTGAGGA TGGTCTGTCT
TGGGGTGTGC TGTTGGAGGA TATTCAAGGA GAAGTTGCTC GGTTGAGAAC GTTGGATCGA
CCGGGGGCGA GGGATGTTGT TGATGAGGCT AGTTTAAAAC GGTTGAGTTT GACTGCTTCG
TTGAATTTGA GTCTTAAGAG ACTGGAGCCG GAAACTAGGG AGGGTTTGAT TTGGTTGGGG
ATATTGCCGG AGGATGTGAA TATTACTCAG GGGATGACGG CTGTACTCTG GGATATGGAT
GATGAGCGGG ATGCCAGGGA TGAGTTGGGG TATTTGCGGA GTAAGGCGTT GTTGTTGGAT
GGTGTGGCTT TGTCTGATGG TAAGAAGAGT TATCGTTTAC ATGATTTATT TCATCATTTG
GCTCGGAATT TGTTGAGTGC TCCCCTAAAG CCAAGGGGTA GGGGTGCTCT GGCGGGGTTG
GGTATTGGTC TGGCTGAGGC TCACGGAATA TTTTTGGAGA AATATAGGAG GTTAACTGAT
AATTATCTTT GGCATACTTT GCCTGATGAT GGTTATATTC ATCAGCATCT GGTCTGGCAT
TTTGAAAGGG CGGGGATGAT AGGGGATATT CATGGGTTAT TGGGGGAGGA GTCGAAAAGT
GGGGCTAATG GTTGGTATGA AACCTGCGAT GGTTTGGGGC GAATTGGGAT TTTTATTACT
GATGTGGCTC GTGCTTGGGA GTTAGCAGAG GTTGACTGGG ATGAAGGTCG GTTGGCTCAA
GTTGTTGGTT GGCAGTGTCG TTATGCTTTG ATCACTGCTT CTATTAATAG TTTGGCAGCG
AATTTACGAA GGGAGTTATT GGTGGCTTTG GTTAAAAACA ATGTATGGAG TCCTGAGCAA
GGGTTGGCTT ATGCATTACA AAAGCCAGAT CTACTGGATA AAGTAAAATC TCTGGTAATG
TTAGTTGATT ATTTGCCAGA AAACTTCAAA AAACAGGCGC TTTCAGAAGC ACTGGCTGCT
GCTCGACAGA CTCAGGATGA AGACGATCGC GCCAAAGCTC TCAGTGTTTT GGCTAAGAAA
TTGACACCAG AGTTATTACC AGAAGCTCTG GCTGCTGCTC GACATATTCA GTATGAAGAC
GATCGCGCCA AAGCTCTCAG TGCTTTGACT GATAAGTTAC CAGAGTTATT ACCAGAAGCT
CTGGCTGCTG CTCGACATAT TCAGGATGAA GACGATCGCG CCCATGCTCT CAGTGCTTTG
GCTGAGAACT TGACACCAGA ATTATTACAA AAAGCTCTGG ATGCTGCTCG ACGTATTCAG
TCTGCATGCT ATCGCGCCCA AGTTCTCAGT GCTTTGGCTG AGAAATTGAT GCCAGAATTA
TTACCAGAAG CTCTGGCTGC TGCTCGACAG ATTCAGGATG AAGACGATCG CGCCCATGCT
CTCAGTGCTT TGGCTGATAA ATTGACACCA GAATTATTAC CAGAAGCACT GGCTGCTGCT
CGACAGATTC AGGATGAAGA CGATCGCGCC CATGCTCTCA GTGCTTTGGC TGATAAATTG
ACACCAGAAT TATTACCAGA AGCACTGGCT GCTGCTCGAC ATATTCAGGA TAAATACGAT
CGCGCCCAAG TTCTCAGTGC TTTGGCTGAT AAATTCCCAG AATTATTACC AGAAGCACTG
GCTGCTGCTC GACATATTCA GGATAAATAC GATCGCGCCC AAGTTCTCAG TGCTTTGGCT
GATAAATTCC CAGAATTATT ACCAGAAGCA CTGGCTGCTG CTCGACATAT TCAGGGTGAA
ACACATCGCG CCCAAGTTCT CAGTGCTTTG GCTAAGAAAC TGACACCAGA ATTATTACCA
GAAGCTCTGG CTGCTGCTCG ACATATTCAG CATGATAAAT ATCGCGCCCA AGTTCTCAGT
GCTTTGGCTG AGAAATTCCC AGAATTATTA CCAGAAGCTC TGGCTGCTGC TCGACATATT
CAGTCTGAGC GGGGTCGCGC CCTAGTTCTC AGTGCTTTGG CTGAGAAATT ACCAGAATTA
TTACCAGAAG CTCTGGCTGC TGCTCGACAT ATTCAGGATG AATACTATCG CGCCCTAGTT
CTCAGTGCTT TGGCTGATAA ATTCCCAGAA TTATTACCAG AAGCTCTGGC TGCTGCTCGA
CATATTCAGT CTGAGCGGGG TCGCGCCCAC GTTCTCAGTG CTTTGGCTGA GAAATTCCCA
GAATTATTAC CAGAAGCTCT GGCTGCTGCT CGACATATTC AGGATGAATA CTATCGCGCC
CAAGTTCTCA GTGCTTTGGC TGAGAAATTC CCAGAATTAT TACCAGAAGC TCTGGCTGCT
GCTCGACATA TTCAGGATGA ATACTATCGC GCCCAAGTTC TCAGTGCTTT GGCTGAGAAA
TTCCCAGAAT TATTACCAGA AGCTCTGGCT GCTGCTCGAC ATATTCAGGA TGAATACTAT
CGCGCCCAAG TTCTCAGTGC TTTGGCTGAG AAATTACCAG CAGAATTATT ACCAGAAGCT
CTGGCTGCTG CTCGACATAT TCAGTCTGAA CACTTTCGCG CCCACGTTCT CAGTGCTTTG
GCTGAGAAAT TACCAGCAGA ATTATTACCA GAAGCTCTGG CTGCTGCTCG ACATATTCAG
TTTGAGGAAG CTCGCGCCAA AGTTCTCAGT GCTTTGGCTG AGAAATTACC AGAATTATTA
CCAGAAGCTC TGGCTGCTGC TCGACATATT CAGTCTGAGT GGGATCGCGC CCAAGTTCTC
AGTGCTTTGG CTGAGAAATT ACCAGCAGAA TTATTACCAG AAGCTCTGGC TGCTGCTCGA
CATATTCAGT CTGAGTGGGA TCGCGCCAAA GTTCTCAGTG CTTTGGCTGA GAAATTACCA
GAATTATTAC CAGAAGCTCT GGCTGCTGCT CGACATATTC AGTCTGAGTG GGGTCGCGCC
CAAGTTCTCA GTGCTTTGGC TGAGAAATTA CCAGAATTAT TACCACAAGC ACTGACTATT
GCTCAAGGGA TTAAGGATAA GGAATCTCGC GCCCAAGTTA TCGTTGCTTT AGCTGATAAA
TTGACACAAA TGCCAAAAAC TGAACTTTAC CCACTCTGGC AAGACACCCT TCACGCCCGA
TCCCTCCGCA CTCGCCGCGA CTTACTCTTA GACATAAGAA AACTAACTCC CGTCATCTTT
TATTTAGGAG GTCAAGAAGC AATCAAAAAT ACAGCCATTG CTATTCAAGA AATTTCCCGG
TGGTGGCCTT GA
 
Protein sequence
MTAIHGLGAV GKSTLATALA YDGDVQSRFC DGILWVTLGQ EPNILPMLGR WVQKLGDYDF 
KPTSVEVTVN HLRILLSDKA VLLVVDDAWN SDHAQIFNLG GPRCQVLVTT RERAIAEALS
AKTYSLDVMT LEQSMLLLSG RLGREIWGEE ARQAEALVQE LGYLPLALEL AAAQVEDGLS
WGVLLEDIQG EVARLRTLDR PGARDVVDEA SLKRLSLTAS LNLSLKRLEP ETREGLIWLG
ILPEDVNITQ GMTAVLWDMD DERDARDELG YLRSKALLLD GVALSDGKKS YRLHDLFHHL
ARNLLSAPLK PRGRGALAGL GIGLAEAHGI FLEKYRRLTD NYLWHTLPDD GYIHQHLVWH
FERAGMIGDI HGLLGEESKS GANGWYETCD GLGRIGIFIT DVARAWELAE VDWDEGRLAQ
VVGWQCRYAL ITASINSLAA NLRRELLVAL VKNNVWSPEQ GLAYALQKPD LLDKVKSLVM
LVDYLPENFK KQALSEALAA ARQTQDEDDR AKALSVLAKK LTPELLPEAL AAARHIQYED
DRAKALSALT DKLPELLPEA LAAARHIQDE DDRAHALSAL AENLTPELLQ KALDAARRIQ
SACYRAQVLS ALAEKLMPEL LPEALAAARQ IQDEDDRAHA LSALADKLTP ELLPEALAAA
RQIQDEDDRA HALSALADKL TPELLPEALA AARHIQDKYD RAQVLSALAD KFPELLPEAL
AAARHIQDKY DRAQVLSALA DKFPELLPEA LAAARHIQGE THRAQVLSAL AKKLTPELLP
EALAAARHIQ HDKYRAQVLS ALAEKFPELL PEALAAARHI QSERGRALVL SALAEKLPEL
LPEALAAARH IQDEYYRALV LSALADKFPE LLPEALAAAR HIQSERGRAH VLSALAEKFP
ELLPEALAAA RHIQDEYYRA QVLSALAEKF PELLPEALAA ARHIQDEYYR AQVLSALAEK
FPELLPEALA AARHIQDEYY RAQVLSALAE KLPAELLPEA LAAARHIQSE HFRAHVLSAL
AEKLPAELLP EALAAARHIQ FEEARAKVLS ALAEKLPELL PEALAAARHI QSEWDRAQVL
SALAEKLPAE LLPEALAAAR HIQSEWDRAK VLSALAEKLP ELLPEALAAA RHIQSEWGRA
QVLSALAEKL PELLPQALTI AQGIKDKESR AQVIVALADK LTQMPKTELY PLWQDTLHAR
SLRTRRDLLL DIRKLTPVIF YLGGQEAIKN TAIAIQEISR WWP