Gene Tery_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0159 
Symbol 
ID4241752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp240062 
End bp241228 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content39% 
IMG OID638105507 
Producthypothetical protein 
Protein accessionYP_720126 
Protein GI113474065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.322221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.596487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGGG AAATCTGCCA AGTTGGCGAA GAAATCTTAA AGCTACTCCT AGATGAATTT 
CAGCAGTCTA CTCGTGGGTC AAGGCAAAAT TGCCGAGAGG TGGCAGAACG TATTACACAC
GAAGTAGATA GGATTTGCAC AGAAAGTAAA AGAATTCAAG CTTCGGGAGA AGTGGGTAAA
TGGGCTAAAA ATTTAGCTCT ACATCGCTTG AAACGATGTA TACATTACTA CCAGCTTCGT
TCTCAAGAAG GAAGAATAGA ATTACATAGC ACCTTCAGCG CTATTATTTA TAGATATATC
ACTCCTGCCC AAATACAATC AAGTTATCAG GCCAAATTAA ATCTGATAGA AGATTTTTTA
CAACAATTTT ACCTGGAAAC TTTGAATGCT TTTCGGCGAG AAAGTGAACT ACCAGCAACT
TATCGTCCCC GTACTTTGTT AGAACTTGCA GAGTATATGG CCTTTACCGA ACGTTATGGA
AAGAGACGTA TACCTTTATC TGGTGGTCGT AGTCAACAAT TGATTATTTT ACGGGCACAA
ACATTTTCCC AACAACAACC AAAGGAAACA TTTGTAGATA TTGACCAAGC AGCAGAAGGA
ACAACTACTG ACTCAGATAA GACTTGGAAC GATAGATCTA TCCAAGAAGT TCGAGAAGCA
ATGGTTGCAC AAGACCCAGG TAATAATATT GCTTCTTTGC GTCAGGTTGT GATTGAAGAA
CTAATGGCCT ATCTAGAGGA ACGTGAACAG AAAGACTGTG CAGATTACTT TGCATTGCGT
TTACAAGATT TATCAACTGG AGAAATAGAA TCTATCTTAG GTCTAACTCC CCGAGAAAGG
GATTATTTAC AGCAACGCTT TAAATACCAT TTGCTCAAAT TTGCTATGGG ACATCGTTGG
GAACTGGTTC ATCAATGGTT AGAAGCAGAT TTAGAACAAA ATTTAGGCTT AACTCCTACG
GAGTGGGAAG CCTTGCATCA CAAAATTGAT TCAGAGCAAA AAAATTTGCT AAAATTAAAA
CAACAAGGTA TTTCCGATGA TGTGATCGCG AAAACTTTAG GTCGTAAAAT CAACCAGGTT
AAGAAAAAGT GGTATAAATT ACTCGAACTT GCCTGGGAAT TACGAAATCG TTCAGGTTCC
GGAGCAGGGG CATCAAGTGA TGAATAA
 
Protein sequence
MMGEICQVGE EILKLLLDEF QQSTRGSRQN CREVAERITH EVDRICTESK RIQASGEVGK 
WAKNLALHRL KRCIHYYQLR SQEGRIELHS TFSAIIYRYI TPAQIQSSYQ AKLNLIEDFL
QQFYLETLNA FRRESELPAT YRPRTLLELA EYMAFTERYG KRRIPLSGGR SQQLIILRAQ
TFSQQQPKET FVDIDQAAEG TTTDSDKTWN DRSIQEVREA MVAQDPGNNI ASLRQVVIEE
LMAYLEEREQ KDCADYFALR LQDLSTGEIE SILGLTPRER DYLQQRFKYH LLKFAMGHRW
ELVHQWLEAD LEQNLGLTPT EWEALHHKID SEQKNLLKLK QQGISDDVIA KTLGRKINQV
KKKWYKLLEL AWELRNRSGS GAGASSDE