Gene Tery_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0937 
Symbol 
ID4245676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1471755 
End bp1473455 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content34% 
IMG OID638106192 
ProductWD repeat-containing protein 
Protein accessionYP_720804 
Protein GI113474743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTATC TGAGAAAGAG GGCCCGCCCT TGGGTAAGTT CAAAATATAT CAAAATATTA 
AGTAAAATGA AAGTCGAAGA AGCATTGGAA GTTCTGGAGA CAGTTCTACC TCCAGGCTCC
TTAAACGCTG TAAAAAAAAT GGTATTTTCT CAAGCCTGGG AAGATAAAGG ATATTCTGAA
ATTGCCGAGC AAGCAGGTTA TGATCCAGAC TACATTAAAG GAGTAGCTGC TAACTTATGG
CAAAGTATTT CTAATGTCTT AGACGAAAAA GTAACCAAGA AAAATTTTCG CGCTCTGCTG
AGACAAAAAT TTGGCATTCA GAAATCATTT ATTGCCAAAA CCGAGCTAAA TACTCAACAG
CATCTAACTT CCCTGTCTTC TTGTGAAACA AATAAAATAG TATATAAATC AAAAGTAATT
GATTGGGGAG AAGCTATAGA TGTTTCTGTT TTTTACGGAC GCTCTCAAGA ACTCAATCAA
CTGCAAAAGT ATATTATCGC AGATGGTTGT CGCTTGATAG CCCTACTTGG TATGGGTGGT
ATCGGTAAAA CAGCGGTAGC AGCAAAAGTT GCTACACAAC TACAAAGTGA ATTTGACTAT
ATAATTTGGC GATCGCTGCG CCACTCTCCA CCACTAAAAA TAATGCTGAG AGAACTGATC
TCGTTTTTCT CTCACCAAAA ATGTACTCAA GGAGAACTAA GCAAACTTCT TGAATACTTA
CGCCAGTCAC GCTGTCTAAT AATTTTAGAT AGTGTCGAAA CTATTTTAAA AGCTGGATGT
ACAGGTTATT ACCGCTCTGG TTATGAAAAC TATAGTCAAT TATTTCAGTT AATCAGCGAA
ACATCTCACT CTAGCTGTCT TATTCTCACC AGTAGAGAAA AACTCCCAGA AGTAGCAGCC
CTTGAAAGTA TAGATACAGC AGTACGATCT TTGCAACTAT TTGGATCAAA AGAAATAGCT
AAAGCCTTAC TAGAAACTAG AGAAATATCA GGTTCAGAAG CACAAAAACA ACAACTTAGC
GAATATTATG GCTATAGTCC CCTAGCATTA AAAATAGTCA CTACCTCTAT CAAAGACTTA
TTCGATGGAG ACCTAAAAGA ATTTCTCCAA CATAATACTA CTACCTTCAA TGGTATTCGC
CGACTCCTCG ACCAACACTT TCATCGTCTT TCAGAACTAG AAAAAAAAAT TATGGTTTGG
TTAGCAGTTA ACCAAGACTG GACTAGTGTA CAAAAATTAG AAACTGATAT TGTGCCAGCA
ATTTCTAAAG TTAATCTCCT GGAAAGCTTA GAAGCTCTCA TCTGGCGCTC CATAGTCAAG
AAAAAATTAA GTATGTATAC AGTTGAACCT CTAGTAATGG AGTACATTCT TAACTACCTA
ATTGAAGAAG TTATTGGTGA ATTAATTACT ACTAACTTAA ACTTATTTGT TACTCATTCC
TTAATCATAA CTACTGAAAA CTCCTCTATT AAAGAACGAC AAAATAAGTT AATTATTGAA
CCAATTGCTA GACAACTAAG TAAAATATTT AGTTCTGATA AAACCCTCAA AAAACAATTA
TTATTAATCC TAAATAAGCT AGAAAGTAAC GAAATTTTAC CATGTGGTTA TGGCAAAGAA
AATCTTATTA ATCTTTCTAT TAAACTAGAG ATTGATTTAA TGAATATTGA TATTTATTCT
CATAAAATAG AATATAATTA A
 
Protein sequence
MIYLRKRARP WVSSKYIKIL SKMKVEEALE VLETVLPPGS LNAVKKMVFS QAWEDKGYSE 
IAEQAGYDPD YIKGVAANLW QSISNVLDEK VTKKNFRALL RQKFGIQKSF IAKTELNTQQ
HLTSLSSCET NKIVYKSKVI DWGEAIDVSV FYGRSQELNQ LQKYIIADGC RLIALLGMGG
IGKTAVAAKV ATQLQSEFDY IIWRSLRHSP PLKIMLRELI SFFSHQKCTQ GELSKLLEYL
RQSRCLIILD SVETILKAGC TGYYRSGYEN YSQLFQLISE TSHSSCLILT SREKLPEVAA
LESIDTAVRS LQLFGSKEIA KALLETREIS GSEAQKQQLS EYYGYSPLAL KIVTTSIKDL
FDGDLKEFLQ HNTTTFNGIR RLLDQHFHRL SELEKKIMVW LAVNQDWTSV QKLETDIVPA
ISKVNLLESL EALIWRSIVK KKLSMYTVEP LVMEYILNYL IEEVIGELIT TNLNLFVTHS
LIITTENSSI KERQNKLIIE PIARQLSKIF SSDKTLKKQL LLILNKLESN EILPCGYGKE
NLINLSIKLE IDLMNIDIYS HKIEYN