Gene Tery_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1741 
Symbol 
ID4245398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2650665 
End bp2652191 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content39% 
IMG OID638106865 
Producthypothetical protein 
Protein accessionYP_721474 
Protein GI113475413 
COG category[S] Function unknown 
COG ID[COG1649] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACAA AGTTTACCTC TTTGAAAAAA ATTAAATCTA GAAAATATTT TCATATAAAT 
AGTTTGGGAG CTATTTTATT TAGTCTGGCA GTAAATTTAT CCCCGTGGTT TTCAGCAAAA
GTACAAGCAC AAACAGACAT TTATTGTAAG TTACCGCCAG AAGCGATCGC CTCTAAAGAA
AATCTTCGCC AAGCAGTTTT AGAAGGAAAT AAAAATGCAG AAAAACAATA TCAAGATATC
CTAATTAAGC ATAATAGAGA AGTTGGCAAT TGTCGCATGA GAAATTGGCC GAGAACTCAA
GGTATTTGGT TGCGACTATA TCCTTGTGAT GCTAGACCAG GAGAAATTGA CAGGATACTA
GATAAAATTG TTAATCAAGG TTATAACCAA GTATATATAG AAGCTTTCTA TGACGGGCAA
GTGCTTCTAC CCGCAGCAAA TAACCCTACA GTTTGGCCTT CTATACTCCG TGTACCTGGT
TATGAAAATG TCGATCTATT AGCTGATAGT CTGAAAAAAG CAAAGGAAAG AGGTTTGCGT
GCTTATGCTT GGGTATTTAC CATGAATTTT GGGTACACTT ATTCTCAACT GCCTAACCGT
CAACAAGCTT TAGCGCGTAA TGGTAGAGGT CAAACAACCC TGGATGTGAT TCCAGATAAT
GTTAGTTTAC AGAACCAGTT AGGTGCGAGT CATGCTTTCC ATACTTTTAT AGATCCTTAC
AGTCCCCAAG CACGGCAAGA TTATAATGTT ATGGTGAATG AGGTTTTAAA ACGACAACCT
CAAGGAGTTT TATTCGACTA TATTCGTTAT TTGCGGGGAA TGGGGAGTGA CTCTGTAGCC
GACCAGGTAA AAGATTTATG GATATATAGT GAGGCTTCTC AGAATGTGTT ATTGCAACGG
GCTAAAAATG AAGCGGGAAA GGAATTAATT AGAAAATTTG TAGACAAGGG GTATGTTACT
TCTCAAGAAA TTAATGGGAG AACTCCCAAA TGGCAACGTT TCTTTTCACC CTCTATTAAT
AGCAGACTAA CGGAGCGAGG TTTGGAAACA CAAATTTGGG AATTGAGTGT TGCTCATGCT
GCTCAAGGAA TACTAGATTT TCTCCAGGTA GCTAGTCAAC CAGTGCAAGA AAAAGGTCTG
CCTGCTGGTG CTGTATTTTT CCCTGGTGGG AATAGAAGAA TACAGAGTAA TGGTTTTGAC
TCTCGCCTCC AACCTTGGGA TCAATTTCCG ACTTCGATGG AATGGCATCC AATGGCGTAT
GCAACTTGTG GCGATCTCGA TCCCAGTTGT ATTGTTTCTA AAGTGGAGAG AGTTATGAGT
ATGACTCCTA AGGGGGTGAA AGTTATTCCG GCGATCGCTG GGGCTTGGGG AGAACCTTTG
AAAAATCGTC CTTCTTTGGA AATACAAATG CAAGCTATTA AAGTCGCAAC TCCTCAGATT
AATTCTATTA GTCATTTTTC TTATGGTTGG CAAAATATTG AAGAAACCAG GGAACGTAAA
CATTGTCGGT TGTCAACTGG GAATTAA
 
Protein sequence
MQTKFTSLKK IKSRKYFHIN SLGAILFSLA VNLSPWFSAK VQAQTDIYCK LPPEAIASKE 
NLRQAVLEGN KNAEKQYQDI LIKHNREVGN CRMRNWPRTQ GIWLRLYPCD ARPGEIDRIL
DKIVNQGYNQ VYIEAFYDGQ VLLPAANNPT VWPSILRVPG YENVDLLADS LKKAKERGLR
AYAWVFTMNF GYTYSQLPNR QQALARNGRG QTTLDVIPDN VSLQNQLGAS HAFHTFIDPY
SPQARQDYNV MVNEVLKRQP QGVLFDYIRY LRGMGSDSVA DQVKDLWIYS EASQNVLLQR
AKNEAGKELI RKFVDKGYVT SQEINGRTPK WQRFFSPSIN SRLTERGLET QIWELSVAHA
AQGILDFLQV ASQPVQEKGL PAGAVFFPGG NRRIQSNGFD SRLQPWDQFP TSMEWHPMAY
ATCGDLDPSC IVSKVERVMS MTPKGVKVIP AIAGAWGEPL KNRPSLEIQM QAIKVATPQI
NSISHFSYGW QNIEETRERK HCRLSTGN