Gene Tery_0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0790 
Symbol 
ID4243201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1256445 
End bp1257632 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content46% 
IMG OID638106071 
ProductNHL repeat-containing protein 
Protein accessionYP_720683 
Protein GI113474622 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.395295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAT CTCTTAACAA TTACCCTAAT GAAAACCAAG AAATAAAACC ATATATCCTT 
GAACCTAAGG GAGCAGAAAT TATTCTTGGC GCAGCATATT CTAATTCCTT AGTTGTACCA
CTAACACCCA GTAAAACCAC GATGTTCGGA CCCCGTGGGG CTTGCCTCAT CTCCGAAAAC
GGCCCCCTGT GGGTAGCTGA TACAGGACAT CATCGGTTAT TAGGATGGCG ACAACGCCCC
GAAACTGACG ATCAACCTGC TGACTGGGTG ATAGGGCAAC CTGATTTTTC TCAAGAGGGG
CAAAACGCTA ATGGTCAAAC AACCGCAGCA ACTGTTAGCG TTCCTACTAG TATCTGTGCT
TATGGGGAAG GGTTGGCAGT GGCGGATGCT TGGAATCATA GGGTCTTGAT TTGGAAACAG
TTACCGGAAG ATAATAATGT TCCTGCTGAT ATTGTTTTGG GGCAAGCAGA TTTTTCTGAA
AATGAATCTA ACCGAAGCAA GTTAGAAACT GCTGCTGACA GAATGCACTG GCCTTATGGT
GTCATCTGTC ATCACGATCA GCTCTGGGTA GCGGACACGG GTAACCGTAG GGTATTGATG
TGGCAGCAAT TACCAGAGGT TAACGGGCAA CCAGCAGATT TGGTTTTGGG ACAAACAGAT
ATGAGTTGTC GCGATGAAAA TGGTGGCGGA GAGGCAACGG CAGCAAGTAT GCGTTGGCCT
CATGATATTA CTTTTTGGGA AGAAAGTTTG GTTGTGACGG ATGCTGGCAA TAACCGGGTG
ATGGTTTGGG ATGGTATTCC TACTGAGAAT AATCAACCTT GCTCTGTAGT GCTTGGTCAA
TCTCAATTTG ACACGGTACA GTTAAATCAG GGGGTTTATT TTCCTTCTGC TGTTAGTTTG
AGTATGCCTT ATGGTGTGGT GGCGACTGGG GAATGGTTGA TCGTTGCTGA TACTGCTAAT
TCCCGGTTGT TGGGTTGGCG GGATGTTGTG GGGATGGGAA CTCCTGCACT GGCTTTGACT
GGTCAACCTC ATTTTGAGAG TAAGAGTGAA AATGCTTTGA GTTTACATCC AACTCGGCAA
AGTTTGTGCT GGCCTTATGG GATTTCTGTT TGTGGTAATA CTGCTGTTAT TGCTGATTCT
GGTAATAATA GGGTTTTGTT GTGGTCTTTG ACATCCTCCC CAACTTAA
 
Protein sequence
MKVSLNNYPN ENQEIKPYIL EPKGAEIILG AAYSNSLVVP LTPSKTTMFG PRGACLISEN 
GPLWVADTGH HRLLGWRQRP ETDDQPADWV IGQPDFSQEG QNANGQTTAA TVSVPTSICA
YGEGLAVADA WNHRVLIWKQ LPEDNNVPAD IVLGQADFSE NESNRSKLET AADRMHWPYG
VICHHDQLWV ADTGNRRVLM WQQLPEVNGQ PADLVLGQTD MSCRDENGGG EATAASMRWP
HDITFWEESL VVTDAGNNRV MVWDGIPTEN NQPCSVVLGQ SQFDTVQLNQ GVYFPSAVSL
SMPYGVVATG EWLIVADTAN SRLLGWRDVV GMGTPALALT GQPHFESKSE NALSLHPTRQ
SLCWPYGISV CGNTAVIADS GNNRVLLWSL TSSPT