Gene Tery_3973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3973 
Symbol 
ID4244539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6143635 
End bp6144861 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content35% 
IMG OID638108889 
Producthypothetical protein 
Protein accessionYP_723471 
Protein GI113477410 
COG category[S] Function unknown 
COG ID[COG4399] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTAATA TCTGGCTTTA TTTTGTACCA CCAATTGCAG GTGGAATTAT TGGCTATTTC 
ACTAATGATA TAGCCATAAA AATGTTATTT CGTCCTTATC GTCCTTATTA TATTTTTAGG
CGAAAGCTAC CCTTTACTCC AGGATTAATT CCTGCTAACC AAGAACGTTT AGCAAAACGG
GTTGCTGATA CCATCATGGG ATCTTTGTTA ACTCCATCAG AGTTACAAAA CTTAGCACGT
CGTTTATTAC AAACTGAACG GATGGAAGCA GCTATTCTGT GGCTGCTGCA AATGTCTTTA
GACCAATTAA AGCTCAATAC AGATACCAAG AGTACCAAAA TTTTGGCAAA TATTCTACGG
GATTTATTGG GTCAATCTTT GCCACGACTT CTTAAAGTCT GGGCGAAGAG AGAATATTTT
TTAGAAGCTC AAATAAACCA AATTTTTGAC CAGATTTTGC TAGAGTTTCA ACTAACTGAA
ATACAAGCAG CCCAACTATC AGATTGGCTG CTTAAAGTAG TTGTGCCACC AGATGTATTA
AGAAAGACTT TAATTGATTT TCTCACAGAT CAAAATATTT CCATTATTGA TGAAGGTTTT
CGAGAAAAAG CTAGTGGTAC TTATTGGGTA GTAGCTAATT TGTTTGGGCT GCGTAATACT
TTAACTAGGT TAAGGACTTT TTGTTTAGAT GAGAGAGATT TAACTAATCA ACGCTTAATG
GAATTAATTA CTGCATTGGC TGTTAAAGAA AGAATTACTG AATGGTTACA CAGTCTTTCA
ATGCAAAATT TACCTGTTTC TACAGTACGC GAGTTAAGAA ACACAATGCA GAATAGTGTA
CGTTTATATC TACAAGAAAA TGGTACTGAT TTGATTCAAG CTTTAAGTTT ATCTGTTGCA
TGGGAACATA TTGCAGATTT AATCATTAAT CGTTTACAGG CTTCATCAAT AATGAATAGT
TCTCTGGAGT TAGTAAGTCG AGAACTAGCG TTAATTTTAG AACGTTACTT GGAGCGAGAT
TTAGAAAATA TAGTCGCTCT GGCAATTCCT ATTTTAAATA TAGATCAAGT GATTATTGAT
CGTATCAAGG GTACTTCTGC TGAAGAATTG GAAGTTGCTG TTAATGTAAT TGTTAAAAAT
GAGTTGCAAG CTATTGTTAA TTTGGGTGGA GTTTTAGGTG TTGTTGTTGG CTCGTTTCAG
ACGATTTTGT TAGTACTACA AAGGTGA
 
Protein sequence
MSNIWLYFVP PIAGGIIGYF TNDIAIKMLF RPYRPYYIFR RKLPFTPGLI PANQERLAKR 
VADTIMGSLL TPSELQNLAR RLLQTERMEA AILWLLQMSL DQLKLNTDTK STKILANILR
DLLGQSLPRL LKVWAKREYF LEAQINQIFD QILLEFQLTE IQAAQLSDWL LKVVVPPDVL
RKTLIDFLTD QNISIIDEGF REKASGTYWV VANLFGLRNT LTRLRTFCLD ERDLTNQRLM
ELITALAVKE RITEWLHSLS MQNLPVSTVR ELRNTMQNSV RLYLQENGTD LIQALSLSVA
WEHIADLIIN RLQASSIMNS SLELVSRELA LILERYLERD LENIVALAIP ILNIDQVIID
RIKGTSAEEL EVAVNVIVKN ELQAIVNLGG VLGVVVGSFQ TILLVLQR