Gene Tery_4626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4626 
Symbol 
ID4246280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7112497 
End bp7113855 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content33% 
IMG OID638109495 
ProductTPR repeat-containing protein 
Protein accessionYP_724071 
Protein GI113478010 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA TTAAAGGGAA CAGGGTTAAG GAACTCAGAA ATTTTTTTTG TAATCAGAGT 
TTGATAATTT CTCGAAAAAA AATTGGCTGG AAAATTTGGG GTTTTAGACC TAATGATAAT
TATTTTGAGA AGGGTTATAA GGGTTTTACT CTGAGTTTGC TTAGGTTAGT TTCTACTCTA
ATTTTAACTG TTATAGTTTC TATGAGTGAC AGTATTGCTT ATGGTCAAAA TCAAAGTTTT
GATAATTATC AAAAAAGAAA ATCTTTGATT TTTTTGAGTC AAAACCAAGA ACAAGTTGAA
GATTTTAAAC TAAGAGAAGA AATTCGCGAT GGTGTCCGAG AGGAGGTAGA TTATACTTTT
CGTCATGCTA TGTCATTACT TAGTGTTTTT CTAATTGTTT TGACTTTTTT CCCGGCTTCA
GCGGCAGTTT GGATTTGGTT TCTTCAGGCA AAGTTAGCTC ACAAAGTTGA TGTTACAAAA
CAAGAGATTG ATAGTTTTAA ATATGATACA GTATCTCAAC TTAAACAAAT TATTGTTGAT
ACTCAGGTTA TTTTAGATGA ATTGAGGGTA GAAAGTGGCA AGGCTGAGGA AAAAATTGAA
CAACTTCAAC AGGATACTTT GATTCAATTT TCTTCTGAAC AAGGTGATAA TTCTGAGCCT
TTAATGATGG CGAAAGACTA TGCAAAAAAA GCTGATACTT TTTTCTTTTC GGGTCAGTTT
AAGGAGGCTA TTAATGCTTA TAATCAAGCT TTAAAAATTC ATCCAAAAAT GGCGGATGTT
TGGAATAATC GGGGTGTGGC TTTGACAAGA TTAAAGATAT TTGATGAGGC AATTTCTTCT
TATGATCGAG CTTTACAAAT TCGGGCTGAT TATGCGGATG CTTGGAATAA TAGGGGTGTT
TGTTTGATAG AATTGCAGCA TTATCAAGAG GCAATTAATT CTTTTGAGCA AGGAATTAAG
GTTAAACCTG ATTATGCAGA TGCTTGGAAT AATAGGGGTG TTTGTTTGGC AAAAATTCAA
AAATATCAGG AGGCAGTTAA GTCTTATAAT CAGGCAATTG CTATTAAAAA TGATTATGGT
GATGCTTGGA ATAATCGCGG TGCTTGTTTG ATGAAGTTGG GAATTTATGG AGAGGCGATC
GCTTGTTTTG ATAATGCTGT AAAGATTCAA CCTGACTTTT TTAGTGCTTG GTATAATCAG
GCTCGTTGTT ATAGTTTAAA GGGTGATGTT GATATGGCTT TAAAAAGTTT TGAAAAGGCT
GTTAGTTTGA ATGGTAAAAA GTCTCAAAAG ATGGCAAAAA ATGAACCTGA TTTTGATAAT
ATTCGGGACC ATGAATTGTT TCAGAAGTTG ATAGTTTGA
 
Protein sequence
MKIIKGNRVK ELRNFFCNQS LIISRKKIGW KIWGFRPNDN YFEKGYKGFT LSLLRLVSTL 
ILTVIVSMSD SIAYGQNQSF DNYQKRKSLI FLSQNQEQVE DFKLREEIRD GVREEVDYTF
RHAMSLLSVF LIVLTFFPAS AAVWIWFLQA KLAHKVDVTK QEIDSFKYDT VSQLKQIIVD
TQVILDELRV ESGKAEEKIE QLQQDTLIQF SSEQGDNSEP LMMAKDYAKK ADTFFFSGQF
KEAINAYNQA LKIHPKMADV WNNRGVALTR LKIFDEAISS YDRALQIRAD YADAWNNRGV
CLIELQHYQE AINSFEQGIK VKPDYADAWN NRGVCLAKIQ KYQEAVKSYN QAIAIKNDYG
DAWNNRGACL MKLGIYGEAI ACFDNAVKIQ PDFFSAWYNQ ARCYSLKGDV DMALKSFEKA
VSLNGKKSQK MAKNEPDFDN IRDHELFQKL IV