Gene Tery_3463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3463 
Symbol 
ID4244463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5301467 
End bp5304766 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content39% 
IMG OID638108438 
Producthypothetical protein 
Protein accessionYP_723027 
Protein GI113476966 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.639127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC AAAAACAAAA AAAAGCTTTG TCTAAAAAAA ATTTACTACT TCTACCAAGT 
TTGTTAGGTG CATTACTAAG TTTAATGGTA CTGTTACCAA AAGCTGGTAG AGCTAGTGTT
TTATTAGGGA TAATTCGAGA CTTCCAGAAT ACAGATGAAT GGAATCAAGT TATTAGACGA
ATAGATGCTC TCGGTATTAG TTATGAACCA ATTGATTTAA GACAAATAAA AACAGTAGAT
GAGCTATCTG GGGTAAGGGT GATCTTTTTA CCAAATATTG AAGTTTTAAC TGAACCTCAA
GTACAAATTA TTGAAGAGTG GGTGAAGGGT GGAGGAAAAT TAATAGCTAG TGGTCAAATA
GGTCAAAAAT CTCAGTTAGG AGTAAGGCAA AAGTTGCGAT CGCTACTTGG TTCTTACTGG
GCTTTTCCTC TAAGTCAACC GACAATACCA GAACCCAAAT ATCGTTGTTT AGATCTAACT
TGTACAAAAT CTACAAATTG GGCACCAAAA ACCAATAATG TAGGTACAGT CACAGGTGGA
ATATTAATTC CAGCCGGTTT AAATAGCACA ACTGCAGCTA CTTGGAAGGG AACTTCTGGT
TCTTCAGCAG TAGTAATTAC TCCTCAAGTT ACTTATTTGG GTTGGCATTG GGGAAATACT
GAATCTGCTG CTGTGGATAG TATTTGGTTG CAGGCTATTT TGAATCGTTA TCAAGGTCAA
CCAGAATTTA GTGCCAGAAA TAATAATATT TTTTCTTTAG AAAATAATAG ACGGAGTGTA
TCTGATGCTA ATAACTCCAA CCCGGTAAAA ATTCACCCAA GAAGTAACCC TTCGCCGAGT
TCAAGGAAGG AAAGTGTGAG TCCAGTTAGA GTTCAACCTA AAAGGAATTC TGTAAATACT
GGGGAAAATT CAAATATAAA TCCAACTGCT CCTATAGCTG AAAATTCAAT TGTAAATGAA
GATAATATTT GGCTGAGAAG GCAGGAAAAG AAGAGCGGTA CTGAAAACGA ACCAGAAAAT
TTGATTCAAG CAAGGGAAGA ATTAGGTAGC CAAAATAGTA ATTTAGGATC AAAAATAACA
ACGGTTCCTG AAAGTGAAAT TGTTGGTGGG AAAGATACTG CTGTTAGTGA GTCTGAAAAT
TCAACTACAG AGACTACTAA TAATAGAGGT AGGAATATTT GGCAAAGGCT CCAACAAGAA
AAAGAAGAAC AAAATCAGGC TAGTGAAGTT GTTGGTGAGA AAGATCAAGT TATTGACTCA
AGTACTAACT CTACTCTACA AAATAATAAT AGGGGGAGTA TTTGGAACCG GACTCAAGAA
GAAACAAAAA CAAGGGCATC ATCTTCTCGC CGTCATCCTA TAATGCGCTT ACTCAGGTCT
CTACCACCAA TAGAAGTACC AAGTGCTCAA AGAGACCCTT CATCAGGATC AGCATCTCCT
GGTTTAGATA TTCGACAAGG AAATTATCCA ATTAGCAGGG CGAAAGCTTA TGCCATGTTA
GAGGAATTAA ATAATCTCCT TGGTAGATTT GAAAGCGCAT TGATAGCAGC TAAATCTGCA
AATGTAAAAG TTGATCTGGC AGCAGATGAT GTGAGTTTGT TGGCTGCGAG TACTGGCAAT
GCTAGGTTTA TTGCTCAAAG AAATCAAAAA ATTAGGGGTG GTCAACAAGT TATTGTTAAA
GTGCGTCAGG TAATTCAAAA TTTTCCCCAA CAGGTAAAAG CTAAACAATA CGCTGCTGCG
AGAAACCAAT GGCTACAAGC AAGGCAAATG TTGTGGAATA ACTACCCGAC TGATGGTCAA
AGAGCGGGAG CTGAGATTCG AGCTGTTTGG TTAGACCGGG GAACAATTGT GAGGGCGAGG
TCTGAAAGAG GTTTGGCTGG GGTATTTAAC CGACTTGCTG CTGCTGGTAT TAATACTGTT
TTCTTTGAAA CCATTAATGC TGGTTATACG ATTTATCCTA GTAATGTTGC TCCAAGACAA
AATCCTTTGA CAACTTCTTG GGATCCTCTG AAGGCGGCGG TGAAGTTAGC CCACGAAAGG
AATATGGAGT TACACCCTTG GATTTGGGCG TTTGCAGTGG GGAACAAAGC TCATAACCAG
GCTCTTGGTC AAGGAGATAG TTATTTGGGT CCGGTAATTT CGGCTCATCC TAGTTGGGTG
ATGACTGATA AAAGGGGTCG CAAAAGACAT CCTTTAGATG GCAAGGTTTA TATGGATCCT
GCGAATCCTG AGGTGAGGCA ATATTTGCTG AATATAATAG ATGAAATTGC TAGTCGGTAT
GAGGTTGATG GGATTCACCT TGACTATATT CGCTATCCTT TTCAAAATCC TGAACGGAAT
TTTTCTTATG GTTATAGTAC AATAGCGCGT AATCAGTTTC GGCAGTTGTA TGGGATAGAT
CCGATGAAGA TTTCGTCACG GGATCGCCAG AATTTGTGGA GGTGGACTGA GTTTAAGATT
AACCAAGTTA ATAGTTTTGT TGCTAATACT TCTAGTTTTC TCAAAAAGAA GTATCCAAGG
TTAATTTTTT CGGTAGCGGT GTTTCCTTTT CCTCGTCATC AACGCTTTGA TCAAATTCAG
CAAGACTGGG AAAGTTGGGT TATGAATGAG GATATTGATT TGTTGACTCC TATGACTTAC
GCTTTAGATA CAAATCGTTT TCAGCGAATA ACTCAACCGC TGACAAATAC TGGAGTGTTA
GGTAGTACTT TGATAACGCC GGCGGTTAAG CTTTTGAATA TTCCTGAAAT TGTAGCAGTA
GACCAAATTC AAGCAGCTCG AGATTTACCT ACTGGGGGTT ATATTATTTT TGCGGCGGAA
AGAATTACTG GTGGTTTCCA TGGATTTTTA ACTCGTACTC AAGGTGGGGT AGAAATGGAT
AATACTAGAA GAGCTTCATT AAATACTTTT GGTAAAACAG CTAAAGTTGT GGAGGGTGTA
ATTCCTTACC GTCAACCATT TATTGCTGCT GCAGATCGTT TTCAAGCTTT GAAAAAAGAA
TGGAGTTTTT TGTTGGGAAA TGAGCAACTT TTCCTGAGGG AGCTTCAGCT TGAGAGTTGG
GGTAATGAGG TAGGGGAGTT AGCAACAGCT TTGGAAAATT TGGCTGATAG TCCTGATCAT
AGTAATTTTA ATATTGCTAA GAGAAAGCTG AGGAAATTCC AGTTAAAATT TAGAAGTAAT
ATGAGTGATC ATGCTAGGCA AAATGCTTAT CAAGTTCAGA CTTGGCAAAA TCGTTTGACA
GCTTTGGAGA TGTTGTTAAA TTATGGGGAA AGGGTCAAGT TAAATGATCA GAGGTTTTAA
 
Protein sequence
MNQQKQKKAL SKKNLLLLPS LLGALLSLMV LLPKAGRASV LLGIIRDFQN TDEWNQVIRR 
IDALGISYEP IDLRQIKTVD ELSGVRVIFL PNIEVLTEPQ VQIIEEWVKG GGKLIASGQI
GQKSQLGVRQ KLRSLLGSYW AFPLSQPTIP EPKYRCLDLT CTKSTNWAPK TNNVGTVTGG
ILIPAGLNST TAATWKGTSG SSAVVITPQV TYLGWHWGNT ESAAVDSIWL QAILNRYQGQ
PEFSARNNNI FSLENNRRSV SDANNSNPVK IHPRSNPSPS SRKESVSPVR VQPKRNSVNT
GENSNINPTA PIAENSIVNE DNIWLRRQEK KSGTENEPEN LIQAREELGS QNSNLGSKIT
TVPESEIVGG KDTAVSESEN STTETTNNRG RNIWQRLQQE KEEQNQASEV VGEKDQVIDS
STNSTLQNNN RGSIWNRTQE ETKTRASSSR RHPIMRLLRS LPPIEVPSAQ RDPSSGSASP
GLDIRQGNYP ISRAKAYAML EELNNLLGRF ESALIAAKSA NVKVDLAADD VSLLAASTGN
ARFIAQRNQK IRGGQQVIVK VRQVIQNFPQ QVKAKQYAAA RNQWLQARQM LWNNYPTDGQ
RAGAEIRAVW LDRGTIVRAR SERGLAGVFN RLAAAGINTV FFETINAGYT IYPSNVAPRQ
NPLTTSWDPL KAAVKLAHER NMELHPWIWA FAVGNKAHNQ ALGQGDSYLG PVISAHPSWV
MTDKRGRKRH PLDGKVYMDP ANPEVRQYLL NIIDEIASRY EVDGIHLDYI RYPFQNPERN
FSYGYSTIAR NQFRQLYGID PMKISSRDRQ NLWRWTEFKI NQVNSFVANT SSFLKKKYPR
LIFSVAVFPF PRHQRFDQIQ QDWESWVMNE DIDLLTPMTY ALDTNRFQRI TQPLTNTGVL
GSTLITPAVK LLNIPEIVAV DQIQAARDLP TGGYIIFAAE RITGGFHGFL TRTQGGVEMD
NTRRASLNTF GKTAKVVEGV IPYRQPFIAA ADRFQALKKE WSFLLGNEQL FLRELQLESW
GNEVGELATA LENLADSPDH SNFNIAKRKL RKFQLKFRSN MSDHARQNAY QVQTWQNRLT
ALEMLLNYGE RVKLNDQRF