Gene Tery_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1066 
Symbol 
ID4241951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1667703 
End bp1670750 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content33% 
IMG OID638106296 
Producthypothetical protein 
Protein accessionYP_720908 
Protein GI113474847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.619549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTATT CGATGAGCCC TAATACTGAA CCACAAATTA GTTATTTCCG AGAATGGACT 
AATAGCTGTG TTGATGACCA ATTAATCCAC CTTAACGTTG TCCCATTAGA AGGACAACGA
GCTTATGAAT TTTTATTTTA TTCTGATGCT ATTCCTCGAC GAAATGATGG TCGAGTCACA
AGCGAAATAT TAAACCGATA CCAACATATT GAAGAAGGTG GGTGGTGGTG TTCTGGAATT
GATTTATTAT CAGGAGAAGA AGATTTTTGG GGTTGTTTTA AACCTAGTCA ACCACGTCAT
AGTTACGACG AAAAAAAAAT AATTAAATAT GAGCACCCTC CCAAAACTCC TACTAGTTTA
TTTGCTCTAA AAATTCCCCT ACATTTATGG CATAAAATAG CGAGTCATTA TCAATTAACA
ATTTTGCCAG AAGATATTGA TAATAATCAA CCAGACTTTG GTTTTTGGCA GTGGTTTATC
GCCCATCCTC AAATACCTTT ATGTATTACT GAAGGGGCAA AAAAAGCAGG AGCTTTATTA
ACAGCATCCT ATGTAGCTAT TGCCTTACCA GGAGTATTTG GTGGATACCG AGTTTTGAGA
GATGAATATG GTAACCGTAT TGGTAAACAA CATTTAATTC CCCAGTTAGA AAAGCTGATT
AATAATACTC GAGAAATTTA TATTGCATTT GACCAAGATA CTAAAGCTAA AACTATTAAA
AATGTTAATG CTGCCATTAG AAAAACTGGA TATTTACTCA TAAAAAAGGG ATGTAAAGTT
AAAGTAATTT CCTGGAATCC AGAATTAGGT AAAGGAGTAG ATAATTTAAT CGCTAATCAT
GGAAAAAATG TTTTTGATGA AGCTTATAAA AAGGCATTAC CTTTAGAACT TTGGAAAGCT
AAATCATTTA TTCGTCTCAC ATATCCGGTA AATTTGAGAG TTAATAGTCG CTATCTTTCA
GAACAAAATA TATTTAATTC TATAGACAAT AATAACAATA ACTTACATAA ATTAGACGAC
ATTGATTTAG ATTATAGTAT AAATTTTCCC GCTAAACTTA TAGGAATAAA ATCTGCTAAA
GGAACTGGAA AAACTAAGTT TTTAGAAAAA ATAGTTTCTG AAGCTGTAGC TCGTAATCAA
AAAGTTTTAG TTATCGGGCA TCGGGTACAA TTAGTACAGG AATTATGTCA ACGTTTTGGA
TTAAAATATA TTACAGAAGT TAACTCAAAA TCCCCAGATA AATTATTAGG TTTAGGGTTA
TGTATTGACT CTTTACATCC TAATTCCCAA GCTAACTTTA ATCCAGAAAC TTGGTCAGAT
GGAATAGTAA TTATTGATGA AATTGAGCAA GTAATTTGGC ATGCTTTAAA TTCTAATACT
TGTAGGAAAA GTAGAGTAAA AATTCTCAGA TGTTTTAAAG CGTTAATGCA AAATATTTTA
GGGGGTGCAG GTAAAGTATT TATAGCTGAT GCTGACCTCA GCGATATTTC TATAGATTAT
TTACAAGCTT TAGCAGGAGT AAAATTAGAA CCTTTCATTG TTCAAAATGA TTGGTTACCT
GGAGAAAAAG AAGCTTGGAA AATTTTTAAT TACCCAGAAA CAACTCCCAA AAGATTAATA
GCAGATTTAC AAAAACATAT TCGTGAAGGT GGCAAACCAA TAGTTTGTTT ATCAGCGCAA
AAACTGACAA GTAAATGGGG GACTCGTGCC TTAGAAGCTT ATCTGAAAAA ACAGTTTCCC
AAATTAAAAA TTCTGAGAAT AGATTCAGAA TCTTTAGCAG AAGTAAATCA TCCTGCTTAT
GGTTGTATTA AGTCATTAAA TCAAGTATTA CTAAAGTATG ATATTGTTTT AGCTTCTCCC
TCAATTGAAA CTGGAGTTAG TATTGATGTT CAAGGGCATT TTACTTCAGT TTGGGCAATT
GCTCAGGGGG TGCAGGGAGC CACTTCTGTT TGTCAATCTT TGGGTAGAGT TCGTGAGAAT
ATTCCTAGAT ATTTGTGGGT AGCTAATTGT GGTTTTAATC AGGTAGGTAA TGGTTCTACT
TCTATAACTT CTTTGCTCAA TTCTGAGCAA CGTTTGACTC AATTAAATAT TCGGTTATTA
CAACAATCTG ATTTTGATAG TTTAGATGAT TTGGAAGTAG GTTTTCAGGC AGAATCTTTT
TTGTGTTGGG CAAAAATGGC GGTCCGCTTT AATGCTGGAA TGAATCAATA TAGAGAGTCA
GTTTTAGAAT TTTTACGAAT AGAAGGACAG CAGATCATAG AAGTTTCTGC AGAAGCTTTA
CCTGAAAATT TAGAAGAAAA AAAATCATCA GAAACTCCTG AAGAAATTAA TACTTTGCAG
GAAGCTATAG CTATTGTAAT TAAGCAAAAC TATCAGACAG AATGTGAGGC GATCGCTACT
GCGAAAAGTA TTAGTTTATT TGAGTATCAA AAACTCACAA AAAGATTATC AAAACCAATT
CAACAACAAC GAGAACAACG TAAATTTGAG TTGATGTTAC GTTATAGTAT TCCCATAACT
GCTGAATTAG TTCAGAAAGA TGATCGGGGG TGGTACCAAC AGTTGCAATT ACATTATTTT
ATGACAGTAG GGCGACCGTA TTTACCAGCC CGGGATGGGG AAGTAGTAAA AAAATTGTTG
GAGTTAGGTA AAGGTAATAT TTTTATTCCT GATTTTAATG ACTCTTTATT AGGTGCAATT
ATTGGAGTAA TAGAATTATT AAAAATACCT TCATTATTGA AGGATAAAAA ACGAGAGTTG
AAAAATATAG ACCCAGATTT ACAATTATTA GGAAAAACAG CTTTGTCAAA CCGAACAGAA
ATTAAAACTA TATTAGGAAT AGGATTAGCT GCTAACTCTA GCCCCATTAT AATTGTTAGG
CGTTTTTTGG AGAAAATTGG CTATAGTTTG GAATGTTTAC GAACAGAAAC TCACCATAAA
AAACGATTGC GAATTTATCG AATTTTTCAT CCTGATGATG GTAGGTTTGA AGTATTTCAG
CAATGGTTGA GCTCAAGCCA TAACTCAAAG GTCAGAACTT GTGTATAA
 
Protein sequence
MHYSMSPNTE PQISYFREWT NSCVDDQLIH LNVVPLEGQR AYEFLFYSDA IPRRNDGRVT 
SEILNRYQHI EEGGWWCSGI DLLSGEEDFW GCFKPSQPRH SYDEKKIIKY EHPPKTPTSL
FALKIPLHLW HKIASHYQLT ILPEDIDNNQ PDFGFWQWFI AHPQIPLCIT EGAKKAGALL
TASYVAIALP GVFGGYRVLR DEYGNRIGKQ HLIPQLEKLI NNTREIYIAF DQDTKAKTIK
NVNAAIRKTG YLLIKKGCKV KVISWNPELG KGVDNLIANH GKNVFDEAYK KALPLELWKA
KSFIRLTYPV NLRVNSRYLS EQNIFNSIDN NNNNLHKLDD IDLDYSINFP AKLIGIKSAK
GTGKTKFLEK IVSEAVARNQ KVLVIGHRVQ LVQELCQRFG LKYITEVNSK SPDKLLGLGL
CIDSLHPNSQ ANFNPETWSD GIVIIDEIEQ VIWHALNSNT CRKSRVKILR CFKALMQNIL
GGAGKVFIAD ADLSDISIDY LQALAGVKLE PFIVQNDWLP GEKEAWKIFN YPETTPKRLI
ADLQKHIREG GKPIVCLSAQ KLTSKWGTRA LEAYLKKQFP KLKILRIDSE SLAEVNHPAY
GCIKSLNQVL LKYDIVLASP SIETGVSIDV QGHFTSVWAI AQGVQGATSV CQSLGRVREN
IPRYLWVANC GFNQVGNGST SITSLLNSEQ RLTQLNIRLL QQSDFDSLDD LEVGFQAESF
LCWAKMAVRF NAGMNQYRES VLEFLRIEGQ QIIEVSAEAL PENLEEKKSS ETPEEINTLQ
EAIAIVIKQN YQTECEAIAT AKSISLFEYQ KLTKRLSKPI QQQREQRKFE LMLRYSIPIT
AELVQKDDRG WYQQLQLHYF MTVGRPYLPA RDGEVVKKLL ELGKGNIFIP DFNDSLLGAI
IGVIELLKIP SLLKDKKREL KNIDPDLQLL GKTALSNRTE IKTILGIGLA ANSSPIIIVR
RFLEKIGYSL ECLRTETHHK KRLRIYRIFH PDDGRFEVFQ QWLSSSHNSK VRTCV