Gene Tery_3829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3829 
Symbol 
ID4242280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5904039 
End bp5905364 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content35% 
IMG OID638108762 
Producthypothetical protein 
Protein accessionYP_723345 
Protein GI113477284 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTT TAAATAGCCG AGCCTATAAA CTATTAAAAG TAGAAGTTGA CAAACGTGCC 
AAAGAAGATT TGGTTGGGGA AGTGGGAAAA AAAATAGTAA TGAAAAGACT GGATAAGTTA
CGATTTCAAT CAGGAGAGAC CCTAAAAGTA GAAGAATTAA GTTATATAGT AAAAGACCAA
TTTCCTAATT TTAGTGACAC AGTTATTCAG AAAGCAATTA AAATAAATAA GCAACCTAAA
ATATTAAGTA AAATTATGTG GATACCTGCT GCATTAGTGG GAATGGCAGG GATCGTTTGG
GTAGCAAATT TACCCTATCC CATGATTCGT AAACCTGTGT CTAGAGTAGC ACCAATAGTT
TTGCTACCAA GCTATATAAA TATGGATTAT AACTATAGAG AAGCGATCGC TAATATTGAA
CAAGCAGATC AACTAATTAA TAAAGCAACT TCCAGTGCAG ATATTAATTT AGGCGCTAAT
AAAGTTGCTT TAGCACAGAA GCATCTTAAT GAATTGCCAG TATGGTTTTT AGATTTTTAT
CCTAAAAGAT ATTGTACGTT TTTCAGTTGT TCTTGGAAAT TCACATTAGA TGAATTTGAA
ACCGCAAGAA AACAGGTAGG GCGAATGCAG GCAAAGGTAT TTCAAGAAAA CAATGCTCAA
ACATTCTTAG CGGAGGCGGA ACAAACAATT GCCACTGCTA AACAACAATA TCTGGAAGCT
ACAACCTCAA CAGAAAAACA GCAGGCGATC GCTAACTGGC AAAGTGGAAT AGATAGGTTA
GCACAAATTC CACCTACTAC TGTTGCTAAA AAAATAGCAC AACCACAATT AGCAGCAGCT
AAACGAGATT TTGAGAAAGA GGTAGGTTTA GCAGATGGTA ATAAACTCAG TTTCACTTTT
ATAGAAGTGG CAAAAAAATT TGGTTTAAAG GCGGCTGAAG CTGCTCAAAA TGGACCCCAT
TCAGATCAAC GATGGCAAGT AATAGTAGAT TTATGGGAGC AAGCAATTAA TCAATTACAG
AAAATATCAG CAGATAATCC TAGTTATTTG GCAGCTCAGA GCAAAATAGC AGAATATCAG
CTTAACCTCG CTAATGTGAA AATGCGATTA CAAGCAGAAA GGGACTCAGG AACAGCTTTT
AAGCGGGGTA AAGATTTAAT TGCAGATTGG AAAAAGTCTG CAATTAATAA TTCTGATCGA
GGTTTACTTG CTAGTAAGTT ACAAGAAATT ATTAATCAGT TAGAAAATGT AAAACCAGAA
ACAACTTTTT ATCAACAAGC TCAAGAATTA TTGAAATTTG CAAAAGATAA ACAGAAAAAT
TTATAA
 
Protein sequence
MSRLNSRAYK LLKVEVDKRA KEDLVGEVGK KIVMKRLDKL RFQSGETLKV EELSYIVKDQ 
FPNFSDTVIQ KAIKINKQPK ILSKIMWIPA ALVGMAGIVW VANLPYPMIR KPVSRVAPIV
LLPSYINMDY NYREAIANIE QADQLINKAT SSADINLGAN KVALAQKHLN ELPVWFLDFY
PKRYCTFFSC SWKFTLDEFE TARKQVGRMQ AKVFQENNAQ TFLAEAEQTI ATAKQQYLEA
TTSTEKQQAI ANWQSGIDRL AQIPPTTVAK KIAQPQLAAA KRDFEKEVGL ADGNKLSFTF
IEVAKKFGLK AAEAAQNGPH SDQRWQVIVD LWEQAINQLQ KISADNPSYL AAQSKIAEYQ
LNLANVKMRL QAERDSGTAF KRGKDLIADW KKSAINNSDR GLLASKLQEI INQLENVKPE
TTFYQQAQEL LKFAKDKQKN L