Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2349 |
Symbol | |
ID | 4245231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3628413 |
End bp | 3630362 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638107442 |
Product | TPR repeat-containing protein |
Protein accession | YP_722042 |
Protein GI | 113475981 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0186841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA TTACTAAACT AACTGTTGGG TTAATTACTA TTGCTCTGGT CAGCTGCAAT AAATATGAAT CACAAATAAG TACAGATGTT TCTACTAAAA TTTCTAGACC TGATCCAATT ATTACTCCTA GACAAGACAA TGCTCCTTTC AACCAAATTT ATATATATAG ATTAGCTGAA AATATTACTG TAGAAGTTGT TGCTCGTAGT ACTGTTTCTG GAGGTGACCC TTTTAGCTCG TCAGGTTCGG GGGTTATTAT TGGTAAACAA GGTTCTACTT ATTATGTGCT CACTGCCAAG CATATTTTTC AGTACCAAGA TGACTATCGG GTAGTTGTTC GCAGTAAAAA ACCGGGGGAG GATGCGGAAA TTCTGAAGCT AGAAATTATC TACCGTTATC CAGATGCAGA TTTGGCTGTG TTTAAGTTTG CTAGTGTGAA GCAATATAAG GTGGCTGAAG TAGGGGAAGC GAGTCAGTTA AGGAAAAATA GTGAAGTTTA TGTGGGGGGT TGGCCTGGGG CTGAGAATAG GGAAGGTTTT CAGTTCACTC CGGCGAAGGT GACTAACCCA CGGGCAGGAG ATCTTTTAAA TTATGAGCCT ACTGTACCTG GTGAGGGTGT ATATCCTGGT ATGAGTGGGG GGGCGGTTTT GAATAAGGCG GGGCAGTTGA TGGGTATTCA TGTGGGGTTG ACAAAGGCTG GAGATGGGGA GGGGGTTTTG GTTTCGACTT TTTTGCGGGA TATACCACAG CAGGTGAGCA GGGTTTTGGT GAGGTCAACT TCTGCTGCTT TGCCTTCTCC GATACCTTCC CCTAAAAATC GGGCTGTTGG AAATACTGAT AATGTTACTC CATCTCAAAT TCAGAATGCC GAAAGTTACT ATGAACAAGG AGATAAACAC CATGATAGGG AAGAATTTGA ACAGGCGCTC GCTGACTATA ACCAAGCCAT TCAACTTAAT CCCAAATATG CTGATGCTTA CAACAACAGG GGTATTGTTT ACCGTAAGCA GGGAAAATAT GATTTGGCGC TCGCTGACTT AAACCAAGCC ATTCAACTTA ATCCCAAATA TGCTGATGCT TACAAAAACA GGGGTAATGT TTACTATAAC CAGGGAAAAT ATGATTTGGC GCTCGCTGAC TATAACCAAG CCATTCAACT TAATCCCAAA TATGCTGAAG CTTACAACAA CAGGGGTTTG GTTTACGATG ACCAAGGAAA ATATGATTTG GCGATCGCTG AGTTTAACCA AGCCATTCAA CTTAATCCCA AATATGCTTA TGCTTACAAC AACAGGGGTG TGGTTTACGA TGACCAGGGA AAATATGATT TGGCGCTCGC TGACTATAAC CAAGCCATTC AACTTAATCC CAAATATGCT GAGGCTTACA ACAACAGGGG TGGTGTTTAC CTTGAGCAGG GAAAATATGA TTTGGCGATC GCTGACTATA ACCAAGCCAT TCAACTTAAT CCCAAATTGG CTGAAGCTTA CAACAACAGG GGTGCGGTTT ACCGTAAGCA GGGAAAATAT GATTTGGCGC TCGCTGACTA TAACGAATCC ATCAGACTAA ATAACCCTCA GCTTTGGCTG CCTTACAACA ACAGGGGTTT GGTTTACAAT GACCAGAGAA AATATGATTT GGCGCTCGCT GACTATAGCC AAGCCATTCA ACTTAATCCC AAAGATGCTT ATGCTTACTA CAACAGGGGT AATGTTTACG ATGACCAGGG AAAATATGAC TTGGCGATCG CTGACTATAG CCAAGCCATT CAACTTAATC CCAAATATGC TAATGCTTAC TACACCAGGG GTCTGACTAA CAAGGATCAA AGAAATATGG AAAAAGCTAT ATCAGACTTT GAAAAAGCTG CCGACTTATA CAAACAACAA GGAAATCAAA CATGGTATCA AAACTCTCTA GATCAACTCA AAGAATTACG AGGATCTTAA
|
Protein sequence | MKHITKLTVG LITIALVSCN KYESQISTDV STKISRPDPI ITPRQDNAPF NQIYIYRLAE NITVEVVARS TVSGGDPFSS SGSGVIIGKQ GSTYYVLTAK HIFQYQDDYR VVVRSKKPGE DAEILKLEII YRYPDADLAV FKFASVKQYK VAEVGEASQL RKNSEVYVGG WPGAENREGF QFTPAKVTNP RAGDLLNYEP TVPGEGVYPG MSGGAVLNKA GQLMGIHVGL TKAGDGEGVL VSTFLRDIPQ QVSRVLVRST SAALPSPIPS PKNRAVGNTD NVTPSQIQNA ESYYEQGDKH HDREEFEQAL ADYNQAIQLN PKYADAYNNR GIVYRKQGKY DLALADLNQA IQLNPKYADA YKNRGNVYYN QGKYDLALAD YNQAIQLNPK YAEAYNNRGL VYDDQGKYDL AIAEFNQAIQ LNPKYAYAYN NRGVVYDDQG KYDLALADYN QAIQLNPKYA EAYNNRGGVY LEQGKYDLAI ADYNQAIQLN PKLAEAYNNR GAVYRKQGKY DLALADYNES IRLNNPQLWL PYNNRGLVYN DQRKYDLALA DYSQAIQLNP KDAYAYYNRG NVYDDQGKYD LAIADYSQAI QLNPKYANAY YTRGLTNKDQ RNMEKAISDF EKAADLYKQQ GNQTWYQNSL DQLKELRGS
|
| |