Gene Tery_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1476 
Symbol 
ID4241682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2239624 
End bp2241048 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content36% 
IMG OID638106628 
Producthypothetical protein 
Protein accessionYP_721238 
Protein GI113475177 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.276999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.151455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATC CTTTACTTTC TCTCAACTAT GAACCTGCCA TTCAAGCTTT AGGCGGTGAC 
TATTATGATG AGGTACTTTC TGCTGAATTT CCACAACATA TTTTGCGATT TCGTAATGAT
CAATTACTCC CTAAAATAGG ACTAAATTCT CAAGATGTTA AGGATGAGCA TTTTATTGAA
GCTTTTGGTA AGTTTCATTG TGTTAGGCCT TTTTTAGCTC TACGTTATCA CGGTTATCAA
TTTGGTGAAT ATAACCCTTA CTTGGGAGAT GGTAGGGGTT TTCTTTATGG TCAAGTGCGT
GGGGTAGACG ATGAATTATA TGATTTTGGT ACTAAGGGTT CGGGTAGAAC CCCTTATTCT
CGCAGTGCTG ATGGTAGACT TACTCTCAAA GGAGGGGTAC GCGAGGTTTT AGCTGCTGAA
ATTTTGCACC GTCATGGGGT TCGTACTTCT CGATGTCTAA GTTTGATTGA AACTGGGGAA
GGGTTATGGC GTGGGGATGA ACCTTCTCCT ACTCGCTCGT CGGTGATGGT GCGTTTTAGT
CGTTCTCATA TTCGCTTTGG AACTTTTGAA AGACTTCATT TTTATAAGCG CCCAGATTTA
ACGAAAAAAC TATTAAACCA TGTAATTAAT TGTTATTATT CTAATCTGAA AAAAGAGAAT
ATTTCCCAAA AGGATCCGTT TCAAGATTGC TATTTTTTAT TCTACTTAGA ATTAGTAAAA
AGAATTGCAA AATTAGTTGC TCAATGGATG GCTGCTGGAT TTTGTCATGG TGTATTAAAT
ACAGATAATA TGTCAATTAC TGGAGAAAGT TTTGATTATG GTCCATACTC TTTTATTCCC
ACATATAATC CTAAATTTAC AGCAGCTTAT TTTGATTATT CTGGTCTTTA TCGTTATAGT
CATCAACCAT TAGTTTGTAA GTCAAATTTA CAACTACTTC AAGAAGCATT AGCTGCAGTT
ATTGACCGGA AGAATATGAG GTCAGCCTTA GAAAAATTTG ATGATTTTTA TCTACATGAA
TATCGACAAT TAATGATGAG GAGACTAGGG TTTAAAAAGT TAGCTGAAGC CGATGCAGAT
AAGTTACTTC AGCTAACCAT AAAAATGCTC ACAGACTCTC AGGTTGGATA CCACGATTTC
TTTTTGGAAT TAAGACAAAA ATTTTCTCCC GAATGGCGTG ATGATATTAG TCAGATTTTT
GCTGATTTTG AACAGCCAGA ATTAATTGAT CCGTGGCGAC AATATTATTA TCATCTTTTG
CAGACTTATT CTGATAATGA ATTAGAGGAA ATGACGGAAA GGTTACAACA ATATAATCCA
CAACAAAGTT TAATTAGACC TGTCATTGAG TCAGTCTGGG AAGCAATTAC ACTAGAGGAT
AATTGGCAGC CATTTTATGA TTTATTACAG CAAATATATG ATTGA
 
Protein sequence
MSNPLLSLNY EPAIQALGGD YYDEVLSAEF PQHILRFRND QLLPKIGLNS QDVKDEHFIE 
AFGKFHCVRP FLALRYHGYQ FGEYNPYLGD GRGFLYGQVR GVDDELYDFG TKGSGRTPYS
RSADGRLTLK GGVREVLAAE ILHRHGVRTS RCLSLIETGE GLWRGDEPSP TRSSVMVRFS
RSHIRFGTFE RLHFYKRPDL TKKLLNHVIN CYYSNLKKEN ISQKDPFQDC YFLFYLELVK
RIAKLVAQWM AAGFCHGVLN TDNMSITGES FDYGPYSFIP TYNPKFTAAY FDYSGLYRYS
HQPLVCKSNL QLLQEALAAV IDRKNMRSAL EKFDDFYLHE YRQLMMRRLG FKKLAEADAD
KLLQLTIKML TDSQVGYHDF FLELRQKFSP EWRDDISQIF ADFEQPELID PWRQYYYHLL
QTYSDNELEE MTERLQQYNP QQSLIRPVIE SVWEAITLED NWQPFYDLLQ QIYD