Gene Tery_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1784 
Symbol 
ID4242249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2724804 
End bp2725916 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content31% 
IMG OID638106908 
Producthypothetical protein 
Protein accessionYP_721516 
Protein GI113475455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00387398 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATT ACAAATACAG ACCAAATCCT TATATTTTTG GCAAACCAAT TTATCAAAAA 
AAACAATTAT TTGGTAGAGA AAATACTGTT AAAAGAATTC AAAATAACAT TAAAAATAAT
ATAAAAATAA CTTTGTTGCA TATCCAAAGA CGTATTGGTA AAACTTCATT AATAACTTGT
TTGCCTCAGT TTTTCACTGA TGATATTAAG TGTGTTACTT TTTCATTTCA AGGTTATAAA
AATAGATCAA TACTTGAAAT CTTAAATAAT CTTGCTGATG AGATTGCTAA TACTATTGAT
GGAGTCCCTA GAGAAGTAAG AGAACTAGCT GATAGTTCAG AAAATTTTTT TCGGATTTTT
TTGCCAAGGA TTATTAATAA ATACTTATCT GGTAAAAACC TTGTTCTTCT CCTAGATGAA
TTTGATGTTT TAGAAGAAGA TCAAACAACG TTTATTCATG GAAAATACTT ATTTAATGAA
TTAAAAAAAG CTGTGAACCG AGAAGAAAAA CTATTTGCTA TCCTGGTTTT TGGTAGACCA
TTAAAAGATA TGACCTATCT AGGAGAATTG TTACAAAAAG AAGGTCAAGA GACTATAGAA
GTTGGTTTGC TGGATAAGAA AAGTACACAC AATTTGATTG TTGAGCCAGC TAAGGGAACA
TTAGAATATG AAGCAGATGC AATTGATGCT ATTTGGCAAC TATCCGCGGG TCATCCTTCT
CTAACACAAC TACTATGCTT GTATATTTTT AGGTTTTGTA TTCAAAAAGG AATAAAAAAG
GTAACTCATA CTCATGTCGA CTCAATTTTA GATGAAGCAA TGGAGGGAGG TAAGGCAGCA
TTAAAAGGCT TTATAGAACC TTTGAATGAA AACGAAAATT TGTTTTTTCG TGCAGTAGCA
AAAGCTCAGA ATGAAGTTGG AGAAAACCGA CTAAAGGCTA ATATAAAAAA ATCGCAGTCT
GTAGGGAAAC GTTTAGCTGA AGAATATGGC TTTTTAGAAG AAAAAGAAGA TGGAACAGGC
TATAAAATTA AAGTCGAGTT AGTCCGGCGT TGGCTAGTAA AAAATTATCC TTTATCTGAC
AAAGAAAAGC TGCAAGTGGA AAAAGCTATT TAA
 
Protein sequence
MNDYKYRPNP YIFGKPIYQK KQLFGRENTV KRIQNNIKNN IKITLLHIQR RIGKTSLITC 
LPQFFTDDIK CVTFSFQGYK NRSILEILNN LADEIANTID GVPREVRELA DSSENFFRIF
LPRIINKYLS GKNLVLLLDE FDVLEEDQTT FIHGKYLFNE LKKAVNREEK LFAILVFGRP
LKDMTYLGEL LQKEGQETIE VGLLDKKSTH NLIVEPAKGT LEYEADAIDA IWQLSAGHPS
LTQLLCLYIF RFCIQKGIKK VTHTHVDSIL DEAMEGGKAA LKGFIEPLNE NENLFFRAVA
KAQNEVGENR LKANIKKSQS VGKRLAEEYG FLEEKEDGTG YKIKVELVRR WLVKNYPLSD
KEKLQVEKAI