Gene Tery_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4979 
Symbol 
ID4246634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7604596 
End bp7606332 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content37% 
IMG OID638109790 
ProductWD-40 repeat-containing protein 
Protein accessionYP_724366 
Protein GI113478305 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.363391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAA AGAAAACAAT TGTCATAATT ATTATTAGCA ACTTACTCAA TATAAACACA 
ACCAATATCA GGAATTTTAT TGTTGCTATT ACCATCACTA ATTCACTAAT AACACAAAAA
ATTGCTCTGG GATCAACAGC ATCAGAAATT AATAAAATTG CAGCCGAAAT TACAGTTAGA
ATTGACGGAA GAAACAGAAG TGGCTCAGGA GTAATTGTTG AAAAACAAGG AGACACATAT
TATGTACTCA CCAACTGGCA CGTTGTTAAT ACAGTAGGAG ATTACCAAGT ACGTACACCT
GATGGAAAAA TGCACCCTGT TTACTATACC TTAATCAAGC AATTGCCAAA TATAGACTTA
GCCATCATAC CATTTAGCAG TTCTCAAAAC TACCCCATTG CCGAAACAGG AGACTCCACT
AAATTAGTTT CTGGAACTAA GGTATTTGTG GGGGGATGGC CTCGTTCTGG TAGCAATCTC
AAGCAACGAA TTTTTCTGAG CACCCCAGGA GTGGTGAAAG GTCGTCAGCA ACCTGTTGCT
GGTTATAGTT TACTATATGA TAATTTAGTC AGAGCGGGGA TGAGTGGTGG ACCTGTACTT
AATGAAGAAG GTCGCTTAGT AGCTATTAAT GGTATTGTCA AATTGCAAGA AAACTCTGAT
GTCATTGTCT CAGGAGGGAT TGAAATTAAT ACTTTTTGGG ACTGGCGACA AGGAGTTTCA
TTGCCCATTG TTTCTCAAAT ACCTGAAAAT ATCCAAGCTC CAGCAAGTCC AACAGAAACA
ACAAATCCTG TTGCTACAAA TACTGCTACC AGAGAGGACG ATACTGAAAC TGTCCTGGAC
TTTACTCTCG CAAAAGCAAT TACTGATGAA ATTAGTGGAA TAGTTAATTC AATAGTTGTT
CTAAATGCTT ACATCGTTAT GGGAAGCAGT AATGGTATGA TTTCAGTTTG GGATATCGAG
AATAGAGAAA TTATAGCTAT CTGGAAAGCA CATCCAGAGT CAGTAAACTC AGTTGCAGTA
ACTCCTGATG AACAATTTGT TATTAGTGGA AGTGATGACA AAACTATTAA AATATGGAAA
CTACCTAAAA ATAAAAATAT CAATGATATT AGCTTGGTGC AAACTCTCAC AGGACATACT
GATGTGGTAG ATGGAGTTGC GATCGCTCCC AATAGTAAGA TTTTTGCTAG TGGGAGTTGG
GATGGGACTA TTAAAATTTG GAATTTGGCT AGCGGGGAGT TGCTGCAAAC AATAGCCGGA
CATTCTGAGA TAGTCAATGG GATCGCCATT AGTCCAGATG GGCAATTTTT AGCTAGTGGT
AGCAAGGATA ATCAGATTAA ATTGTGGAAT TTGCAGACAG GACAACTTGT TCGTACTATT
AATACTAATT CTGTTTCAAT TTTGTCTGTA GTTTTTAGTC CTGATAGTCA AATCTTAGCT
AGTAGTAGTA GTAATGGCAC AATTAATATT TGGAACTTAC AGACAGGTAA ATTAATCCAT
AATTTAAAAG AACATTTAGA TGGAGTTTGG TCAATAGTTA TTACTCCTGA TGGAAAAACT
TTAATTAGTG GTAGTTGGGA TAAAACAATC AAATTTTGGG AACTGAGTAC AGGTAAATTA
AAAGGAAGTT TAAGGGGACA CAACAGTTAT ATTAGTGTCG TAGCAATTAG TCCTAATGGG
CAAATCATCG TTAGTGGTGG TTGGGACCGG AAGATTAATA TTTGGAAAGC TCCCTAA
 
Protein sequence
MNLKKTIVII IISNLLNINT TNIRNFIVAI TITNSLITQK IALGSTASEI NKIAAEITVR 
IDGRNRSGSG VIVEKQGDTY YVLTNWHVVN TVGDYQVRTP DGKMHPVYYT LIKQLPNIDL
AIIPFSSSQN YPIAETGDST KLVSGTKVFV GGWPRSGSNL KQRIFLSTPG VVKGRQQPVA
GYSLLYDNLV RAGMSGGPVL NEEGRLVAIN GIVKLQENSD VIVSGGIEIN TFWDWRQGVS
LPIVSQIPEN IQAPASPTET TNPVATNTAT REDDTETVLD FTLAKAITDE ISGIVNSIVV
LNAYIVMGSS NGMISVWDIE NREIIAIWKA HPESVNSVAV TPDEQFVISG SDDKTIKIWK
LPKNKNINDI SLVQTLTGHT DVVDGVAIAP NSKIFASGSW DGTIKIWNLA SGELLQTIAG
HSEIVNGIAI SPDGQFLASG SKDNQIKLWN LQTGQLVRTI NTNSVSILSV VFSPDSQILA
SSSSNGTINI WNLQTGKLIH NLKEHLDGVW SIVITPDGKT LISGSWDKTI KFWELSTGKL
KGSLRGHNSY ISVVAISPNG QIIVSGGWDR KINIWKAP