Gene Tery_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1747 
Symbol 
ID4245404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2659750 
End bp2660862 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content42% 
IMG OID638106871 
Producttwitching motility protein 
Protein accessionYP_721480 
Protein GI113475419 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.555642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTGA TGATTGAGGA TGTTTTAGAA TCTTTAGTTG AACAAGGTGG TTCTGATGTT 
CATATTCAAG CAAGAGCACC AATTTTTTTC CGTATTAACG GTCAGTTGAC TGCTCAAACT
CAGTTTGGGG AAAATATGGA ACCAATGGTA GTACAAGCTC TGATTTTCCA AATGCTCAAT
AATATGCAAC GGAAACATTT AGAACAAAAC TGGGAGCTTG ATAGCGCTTA TGGTGTTAAG
GGTTTGGCTC GTTTCCGTCT AAATGTGTAT CGGGAGCGGG GTAGTTGGGC TGCTTGTATG
CGTGCTTTGG CTTCTAAAAT TCCTAATGCA GATAAATTAG GGATTCCCAT GATTTTACGG
GAAATGACGG AGCGACCCAG GGGATTGTTT TTGGTGACGG GGCAAACAGG TTCTGGGAAG
ACAACTACGA TGGCAGCTTT AATAGATTTG ATTAACCGTA CTCGTACTGA ACATATTTTG
ACTGTGGAAG ACCCCATAGA ATATGTGTTT CCCAATCAAA AAAGTTTGTT TCATCAACGA
CAAAAAGGTG AAGATACGAA AAGTTTTGCT AATGCTCTGA AGGGAGCGTT ACGTCAAGAC
CCTGATATTA TTCTGGTTGG GGAAATGCGG GATTTGGAAA CTATTGGTTT GGCTTGTAGT
GCGGCAGAAA CAGGTCACTT GGTTTTTGGG ACTCTCCACA CTAACTCAGC TGCGGGTACG
GTAGACCGGA TGTTGGATGT TTTCCCGCCG GAACAACAAC CACAAATGAG GTCGCAGTTG
GCTAACTCTA TTGTTTGTAT TTGTAGTCAA AACTTAGTTA AGAAAACAGG TGGGGGTCGT
TGTGCGGCTC ATGAGATTAT GTTGAATACT CCGGCGATCG CTAACTTAAT TCGGGAGTCG
AAAAATTCTC AGCTTTATTC TCAAATTCAA ATGGGCGCTA AGTTAGGAAT GCAAACTATG
GAAATGTCTT TGGCTAAACT TTATGAAAAA GGGAATGTGA CTTGGGCAAA TGCTATGGCT
AAGGCAGTTA AGCCTGATGA ATTAGAGGCT TTGATTGGCC CTGAGCCCAG GGAAACTAAG
GCTAAAACTA AGGCTAAAGC TAGAGCTCAT TAA
 
Protein sequence
MSLMIEDVLE SLVEQGGSDV HIQARAPIFF RINGQLTAQT QFGENMEPMV VQALIFQMLN 
NMQRKHLEQN WELDSAYGVK GLARFRLNVY RERGSWAACM RALASKIPNA DKLGIPMILR
EMTERPRGLF LVTGQTGSGK TTTMAALIDL INRTRTEHIL TVEDPIEYVF PNQKSLFHQR
QKGEDTKSFA NALKGALRQD PDIILVGEMR DLETIGLACS AAETGHLVFG TLHTNSAAGT
VDRMLDVFPP EQQPQMRSQL ANSIVCICSQ NLVKKTGGGR CAAHEIMLNT PAIANLIRES
KNSQLYSQIQ MGAKLGMQTM EMSLAKLYEK GNVTWANAMA KAVKPDELEA LIGPEPRETK
AKTKAKARAH