Gene Tery_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1764 
Symbol 
ID4242607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2685686 
End bp2687011 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content39% 
IMG OID638106888 
Producthypothetical protein 
Protein accessionYP_721497 
Protein GI113475436 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTA AAAAACCAAA AATTATTGTT ATAGATGACG ATCCCACAGG TTCCCAAACT 
GTTCATAGCT GCCTACTCCT AACCAAGTGG GATATAGAAA CCCTCAAATT AGGATTACTT
GACGAATCTC CCATATTTTT TATCCTCTCA AATACTCGCG CTCTGACTCC CGAACAAGCA
GCAACTGTAA CACGAGAAAT CACACAAAAT CTGACAGAAG CGATCGCTCA ATCAAACATT
CAAGATTTTT TAGTTGTCAG TCGTTCCGAC TCTACTCTAC GGGGTCACTA CCCAATAGAA
ACAGATACTA TTGCAGCAGA ACTAGGTCAG TTTGATGCTC ATTTCCTAAT TCCAGCATTT
TTTGAATGTG GAAGAATTAC CCGTAACAGT ACCCACTATT TAATAGTCAA TGGTGTAGAA
ACCCCAGTTC ACGAAACAGA ATTTGCCAAA GACTCTGTAT TTGGCTACAC AAGTAGTTAT
TTGCCTGATT ATGTGGAGGA AAAAACTCAA GCTCAGATCA AAGCAGAAAT TGTAGAAAAA
TTTACACTTA ATGACATTCG TTCTGGCAGT TTAGAACGTT TAATGAAACT GACTGGCAAC
CAATGTGGCG TTGTAGATGG AGAAACTCAA GCAGACTTAG ATAGATTTGC CAATGATTTA
TTGGCAGCAG CAAGTCAAGG TAAAAAGTTT TTGTTGCGCA GTGCTGCCAG TATTTTAACT
TCCCTCACTG CTTTGAGTTC TCAACCTGTA GCTGCCGAGG AAATGAGACA ATATGTCAGA
GGTGGGAAAC CAGGTGTAAT TATTGTTGGT TCTCACGTGA AGAAGTCTAC TCAACAGTTG
GAAAGGTTAT TACAAGAATC TGAAGTAGTA GGTGTAGAAG TAGATGTATC TCATTTAGTT
GAAGACTTTC AAGAGCAAAG AGCCACTTTA CTAAAAAACA TTCTCGAAAA AGTTTCTGCT
GTTAATACAG AAGGGAAAAC AACAGTTGTT TATACAAGTC GCAAAGAGTT GACTTTTGAA
AATGTGCAGG TGCGTTTAGA GTTTGGTGTA GCAGTGTCGG AATTATTAAT GGATATTGTT
CGGGGTCTGC CAGAGGATAT TGGATTTTTA ATTAGTAAAG GTGGAATTAC TTCTAATGAT
ACCTTGAGTA AAGGTTTAGC TTTAACTACT GCCAGGTTAT TAGGTCAGGT TTTAGAAGGT
TGTTCAATAG TGCGGACTCC AGATTATCAT CCTCAGTTTC CTGAATTACC TGTGGTATTA
TTTCCAGGGA ATGTTGGAGA TGTAGATGGG TTAGTAACAG TTTATCAGCG TTTGAGTGGA
AAGTAA
 
Protein sequence
MTTKKPKIIV IDDDPTGSQT VHSCLLLTKW DIETLKLGLL DESPIFFILS NTRALTPEQA 
ATVTREITQN LTEAIAQSNI QDFLVVSRSD STLRGHYPIE TDTIAAELGQ FDAHFLIPAF
FECGRITRNS THYLIVNGVE TPVHETEFAK DSVFGYTSSY LPDYVEEKTQ AQIKAEIVEK
FTLNDIRSGS LERLMKLTGN QCGVVDGETQ ADLDRFANDL LAAASQGKKF LLRSAASILT
SLTALSSQPV AAEEMRQYVR GGKPGVIIVG SHVKKSTQQL ERLLQESEVV GVEVDVSHLV
EDFQEQRATL LKNILEKVSA VNTEGKTTVV YTSRKELTFE NVQVRLEFGV AVSELLMDIV
RGLPEDIGFL ISKGGITSND TLSKGLALTT ARLLGQVLEG CSIVRTPDYH PQFPELPVVL
FPGNVGDVDG LVTVYQRLSG K