Gene Tery_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2953 
Symbol 
ID4245295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4588813 
End bp4590948 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content31% 
IMG OID638107991 
ProductTPR repeat-containing protein 
Protein accessionYP_722588 
Protein GI113476527 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0781794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTTT CTAAGACAGC CATCTTAACC CTTACTGCCC TCAGTTCCCA AAATTTAGCT 
ACAGAGGCAA AAGTTACTCC TAATACTAAT CCATATTTTC AGACAACTGC ACCAGAGAAT
ATTAATCTTA CTAACTCTAT TAATGTTATT ACTGTGCCTT CCACAAAATC AATTAGCCCC
CCAGAAATAT TAGTCAGCCA AGCCTATAAT AACCTTAACC TTAAGACTAC TTTATCAACT
CCAAATATAC ATCAAATTAA AAGAACTTTA AACAATAGTT GGTTAGGGAA AATAGTAAAG
TTTTCAGCAA AAAATAATTA TCAATTCCAG CAATTTGCTG AACAAATGGC TTTAAAAACA
GATTTAGAAT CAGAAGCTAG TCTCAATCTT AATCCTACCC TAACTTTATT GAATACTTTA
TTATTAGTTG CAACTTTATT ACCTCCTATA ACTATAGGTT TTTTTTGGCT CATCAGAAGA
TTGGTAATTA AAGAATTAGT AGGTGAAGTT AATAAGAGGT TAACAAAAAT AAGTCAATTA
GAATTAAGTC TGAATCAATC TCAAAAATTA TTTACTGAAT TAGAGAGTCA TATAGTGACT
GCAAAACAAT CAATTGATTT TCTGCATCAA GAAGCTAAGA TATCAAAATC ATCTGTAGAA
CAAATAGAAG TTCTGAAGTC TCAGTTTTTA ATGCAACTGC AAGTGATTAT TTCTGAGGCT
CAAGAAGCCA AACATCAAGC TATTCAAGAA ATTAATTATT CTGTCAATTC AGAAACTAAA
TCTGTTAAAA ATACCCAGAA ATTAAAATCC TCTGAAAAGC AGCCTACTAT GATTGCTGAT
GATTATTTTA AACTAGGAGA AAAACAATTT TATGATGGTC AATATAATCA AGCTTTAGCA
AATTTTGAAA AAGCTATTTC TCTTAATTCT TATTTAAGTG AAGCTTGGTT TAAGTCTGGT
AATGTATTTG TAAAATTGCA CCGTTATTCT GACGCTCTTG CTGCTTACGA TCATGCTATT
GCAATTCATT CTGATAGATT TGAATATTGG TTTAATCGTG GCAATGTATT TGTCAAGTTA
GAACGTTATT CTGAGGCATT AGCTTCCTAT GATAAAGCTC TTTCTCTTAA TCAAAATCAT
GTAGAAATTT GGCTAAACCG AGGTATTTTA TTCAGGAAAT TACAACGGTA TAATGAAGCA
GTTGTTTCAT ATCAAAAAGC TATTTTAATC CAGCCTAAAA ATGTTGATAT TTTGCATAAT
TTGGGTGCTT TATTAGGAAA GTTAGAGCGC TATGAAGAAG CAATCACTAC TTTTGACCAA
GCTCTGAAAA TTCAACCAAA TAAATTTGAA ATTTGGTATA ACCGAGGCAA TTTACTGGGA
AGGATACAAT CTTTTAACGA GGCAATTAAT TCTTATGACA AAGCACTAAA AATTAAACCA
GATAGGTATG AAATTTGGTA CAACAAAGGT GCTATTTTAT GGCAAATAGA AAAATATCAA
GAAGCAGTTA ATTGTTACGA CCAAGCAATT AATTTAATGC CAGATGATTA TGAAGTTTGG
CATAATAGAG GAGTAGCTTT AGGTGCTTTG GAAAAGTATC AGAAAGCAGT TAATTCTTAT
GATAAAGCAA TTAAAATTTA CCCCCAATGT TATCAAGCTT TTATCGGTAA AGCAGAAACA
TTATTGAAGT TAGAACAATA TGAGGAAGCT TTAAGTTCTT GTAATCATGC TATTGCTATT
AAAAAGGAAC GTTATGAAGG TTGGTTATGT CAAGGTCGGG TATTAGAAAA ATTGACAAGT
TATGAAGAAG CTTTGATGGC TTATGATCAA ACAATTGCTA TTAATGATAA TAGTTATGAA
GCTTGGGCAA GAAAAGGTAT GGTCTTAGAA AAAATTAAAC GCTATCAAGA AGCTTTAATG
TGTTATGACC GGGCTATTGC GATTAAACCG AATGATTCAG AATCTCAGCT AAAACGTCGT
AAATTATTGT CAGAATTACA AACAGAAGGA ATTTTCACTG AGGCAAATAA ATATAATTTA
CAACTCAATT TAGATGTTAG TAATAACCAA GATAAAAAAC TCAAAAAAAA TCAAAAAAAT
TATCCGTATC TTGCACAGAA GGGAAGAAAG AGCTAA
 
Protein sequence
MNLSKTAILT LTALSSQNLA TEAKVTPNTN PYFQTTAPEN INLTNSINVI TVPSTKSISP 
PEILVSQAYN NLNLKTTLST PNIHQIKRTL NNSWLGKIVK FSAKNNYQFQ QFAEQMALKT
DLESEASLNL NPTLTLLNTL LLVATLLPPI TIGFFWLIRR LVIKELVGEV NKRLTKISQL
ELSLNQSQKL FTELESHIVT AKQSIDFLHQ EAKISKSSVE QIEVLKSQFL MQLQVIISEA
QEAKHQAIQE INYSVNSETK SVKNTQKLKS SEKQPTMIAD DYFKLGEKQF YDGQYNQALA
NFEKAISLNS YLSEAWFKSG NVFVKLHRYS DALAAYDHAI AIHSDRFEYW FNRGNVFVKL
ERYSEALASY DKALSLNQNH VEIWLNRGIL FRKLQRYNEA VVSYQKAILI QPKNVDILHN
LGALLGKLER YEEAITTFDQ ALKIQPNKFE IWYNRGNLLG RIQSFNEAIN SYDKALKIKP
DRYEIWYNKG AILWQIEKYQ EAVNCYDQAI NLMPDDYEVW HNRGVALGAL EKYQKAVNSY
DKAIKIYPQC YQAFIGKAET LLKLEQYEEA LSSCNHAIAI KKERYEGWLC QGRVLEKLTS
YEEALMAYDQ TIAINDNSYE AWARKGMVLE KIKRYQEALM CYDRAIAIKP NDSESQLKRR
KLLSELQTEG IFTEANKYNL QLNLDVSNNQ DKKLKKNQKN YPYLAQKGRK S