Gene Tery_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2059 
Symbol 
ID4245707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3212172 
End bp3213308 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content31% 
IMG OID638107170 
Producthypothetical protein 
Protein accessionYP_721773 
Protein GI113475712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.3774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGA TTGAATCTAT TGAAAAACAA ATTAACAATA CTAAAAAAAA TCTACTTCTG 
ACTGAGGAAA GAATATCTGA ATATCCTGAG ACTACGAGTG TTCCACTAAA AGACATCAAG
AATAAACGAA AGTTAGAGAC TAGGTTAAAA AAATTGAGAG CTCAATTAAT AGGTCTTGAT
AAGCATCGCG ATCGCTTAAA AGAACGTCGT GATATTCCTC CTTTACTTCC TTACTTTGTT
AATCGTAGTA GTCAACAAAT TGAACTTAAA GAAGCATTAA AAAAATTGTT CAATCATCCT
TATTCTCAGC CACTTTTATG TATAATTCAC GGGGATAAAT TACAATGTCA TGATACATTT
TTTCAATTTC TTGAAAAGGA GTTTCTGCCA AAAATATTCA AGGAAAATAC CAATATAAAT
ACAAATAATA ACTATACAGA ATTTATCAAA GAATACCAAC TGAAATGGCC GCCGAGAAAT
ACTTCTATAG AAGACTTAAA AAATAGACTA GAACACAATT TAGTAGATAC GGTACTTGAC
AATAAATTTG CTTCAAAAGC AGACATAAAT GAGTACCTGG CTAGTATTGG AACAGTTATT
ATTCACACAC ATTTGCTCAC TGCTGATTTG CAGGAAAATT CATCTATAAT TATTGAAAAA
TTTATAGAAT TTTGGCAAGA TTGGCCTGAA CTTATTCCTA ATCAACATCT AATTATTTTC
TTGTTTATTA AATACAAAAT TGAACCAAAA CAAAACTATT TTCAGAAACT ACAGAATTGG
ATTTTTCAGA ATCTATTCAA GAATAATTCC AACACTCGAA CCGGGAATTA TTCTATTCAA
AAAAGTATCA AAGATTTGTC TTCCTCTAAC TTTACTAAAT TTGATAAAAT TATCGGGGTA
GTTCTGCCCG AACTTCAAGG AATTACGCAA ACAGAAGTAG AAGACTGGGC CCGTAGGGAA
GAAGTAAAGA ATTATTGGGA TGAGGAAAAT ATTCAAGATT TAATTAATAT AATTGAGGAT
ATGTTTGAAA AATGGGAGAA ACAGCAAGCA TCTGATACTA TGCCTATGTC TAATTTAGCT
ATAAAATTGA CTGATATTTT GATGGGAAAA ATACCTATCA AGGAGGATAC TGCATGA
 
Protein sequence
MDEIESIEKQ INNTKKNLLL TEERISEYPE TTSVPLKDIK NKRKLETRLK KLRAQLIGLD 
KHRDRLKERR DIPPLLPYFV NRSSQQIELK EALKKLFNHP YSQPLLCIIH GDKLQCHDTF
FQFLEKEFLP KIFKENTNIN TNNNYTEFIK EYQLKWPPRN TSIEDLKNRL EHNLVDTVLD
NKFASKADIN EYLASIGTVI IHTHLLTADL QENSSIIIEK FIEFWQDWPE LIPNQHLIIF
LFIKYKIEPK QNYFQKLQNW IFQNLFKNNS NTRTGNYSIQ KSIKDLSSSN FTKFDKIIGV
VLPELQGITQ TEVEDWARRE EVKNYWDEEN IQDLINIIED MFEKWEKQQA SDTMPMSNLA
IKLTDILMGK IPIKEDTA