Gene Tery_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4049 
Symbol 
ID4242077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6256462 
End bp6258087 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content37% 
IMG OID638108953 
Producthypothetical protein 
Protein accessionYP_723534 
Protein GI113477473 
COG category[S] Function unknown 
COG ID[COG3463] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.49668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.205664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGTTA GTAGGCAAAA AAATCAGTTC CAAATAAAGC TATTTGTGGT AGCGATCGCC 
TTTTTTGTAG TTTGTCTGTT GTTCAACCTA CACCGTTATT ACAGTTTTTA CGCCTCTTAC
GACCAAGGTA TATTTAACCA AGTCTTTTGG AATAGTATGC ACGGTCATTT TTTTCAAAGT
TCCCTCTCTT CAGCTCTTTC CACGAATGTA GTTCATCAAG GCCAAGTCTC AGAAGTATAT
TATCACCGCT TAGGCCAACA TTTTACCCCG GCTCTTTTGC TTTGGTTGCC AATATATGTC
TTATTTCCAT ATCCTATTAC TCTAACAGTC TTACAGGTAA CATTAATTAG TGTTTCGGGC
TTAGTTCTAT ATATCCTGGC TAGACAGCAT CTCCAACCAC AGTTATCAGC CATGATGAGC
ATTAGTTTCT ATGGAGCTAA TGCAGTAATA GGCCCTACCC TAGCCAATTT TCATGATATT
TGTCAAATAC CTTTGTTTGT ATTTAGCTTG CTATTGGCCA TGGAAAAACG CTGGTGGTCT
CTATTTTGGA TTTTAGCAGT ATTAACTTTA GCTGTTCGAG AAGATTCAGG AGTAGTATTA
TTTGGAGTCG GATTCTACTT AATCTTAAGT AAACGCTATC CCAAAATAGG TTTAGCTATC
TGTACTCTCA GTTTTTCCTA TATGATAGCT CTAACCAATC TTATTATGCC AGTGTTTTCT
GAAGACATTT CCCAGAGATT TATGATGGAA AGATTTGGTC AATATACAGA AGGAGACCAA
GCTTCTACTC TAGAAATTAT TTGGGGCATG ATGAGCAACC CTTGGCGCTT GATCAAGGAA
ATTTTTACTC CATTTTTTGG TACTCTCAAA TATTTATTAG GTCAATGGTT ACCTCTAGCA
TTTGTGCCTG CTGTTGCTCC TGCAGCTTGG ATGATAGCAG GATTTCCTTT GCTGAAACTA
TTTTCTGCTA AGGGTTTGTC TGTACTGTCA ATTACAATTC GTTATGCTAT GACAGTAGTG
CCAGGATTAT TTTATGGTGC AATTTTATGG TGGGGAAAGA GAGAATCAGA AGTTAAGAAT
TCACCAGTAC CATCACCAAA ATTTCTTCGT TTCTGGGTAG TTTGTATTTG CCTGTCACTA
TTTTTTACAT TTACATCTAA TCCTAATCGC ACCTTCTATT TTTTAGTACC AGATTCAGTT
AAACCTTGGG TTTATATACC AGCAAACGAA CAATGGCAGC ATGTTAGTCA AATGCGACCA
TTGTTAGATA AAATTCCTGA TGATGCTAGT GTCGCAGCAA CAACTTATAT TATTCCACAT
TTATCAAGTC GTCGAGCAAT TTTGCGATTT CCCAGAATGC AATTTCGCAA TGATGCTAAA
AAGATTGAAA AAGTACAATA TATTATTGTT GACCTGTGGC GGTTGAATAG GTATCGGGTA
GCTTTTAAGA GCGATCGCCA ACGACTAGAA ACAATTGTTC CTCGAATTGA AGAACTTTAT
AATTCTGGTG AATATGGAAT TACTGACTTT GGAGATGGTG TAGTTTTACT AGAAAAAGGA
GTTACTTCTA ATCTTAATGC TGTTGATGGA TGGCAAAACT TCCGCCAAGA AATTGCTCAA
TCTTAA
 
Protein sequence
MLVSRQKNQF QIKLFVVAIA FFVVCLLFNL HRYYSFYASY DQGIFNQVFW NSMHGHFFQS 
SLSSALSTNV VHQGQVSEVY YHRLGQHFTP ALLLWLPIYV LFPYPITLTV LQVTLISVSG
LVLYILARQH LQPQLSAMMS ISFYGANAVI GPTLANFHDI CQIPLFVFSL LLAMEKRWWS
LFWILAVLTL AVREDSGVVL FGVGFYLILS KRYPKIGLAI CTLSFSYMIA LTNLIMPVFS
EDISQRFMME RFGQYTEGDQ ASTLEIIWGM MSNPWRLIKE IFTPFFGTLK YLLGQWLPLA
FVPAVAPAAW MIAGFPLLKL FSAKGLSVLS ITIRYAMTVV PGLFYGAILW WGKRESEVKN
SPVPSPKFLR FWVVCICLSL FFTFTSNPNR TFYFLVPDSV KPWVYIPANE QWQHVSQMRP
LLDKIPDDAS VAATTYIIPH LSSRRAILRF PRMQFRNDAK KIEKVQYIIV DLWRLNRYRV
AFKSDRQRLE TIVPRIEELY NSGEYGITDF GDGVVLLEKG VTSNLNAVDG WQNFRQEIAQ
S