Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1822 |
Symbol | |
ID | 4244536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2778826 |
End bp | 2781759 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638106944 |
Product | NB-ARC |
Protein accession | YP_721552 |
Protein GI | 113475491 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.244157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.101864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGTT TCTTTTTGGT ATTTGACTGC AAAAATTATC CTCCTCAAGG GGATGTTGTT TTTGTTCATG GTTTGGCTGG TCATCCTTGG GGTACATGGC ATCCTCAAAA TAAAAAGGAT AAAGAAGATG TTGACTTTTT GCTGTGTTGG TTGGGTGAAG AACTACAAGC ACATGGAATA GATGTTAATG TCTGGAGTTT TGGATATGAT GCTCCTGGTT TTCAATATTT TGGTCAGGGA ATGCCGCGTT TTGATCTGGC TAGTAATTTA CTGGAGTATC TTCAGGTTTA TGATATTGGC AAGACATCCA CTAGTTTCCC CAAACGCCCA TTGATATTTG TAACTCACAG TTTGGGTGGT TTGGTGGTGA AGGAGGTAAT TCGTACAGCT CAAAGTTTTC CTGAGTATCT AGCTATTGTT AAACAGGTTA AAGGAATTGT ATTTTTGTCT ACTCCACACA CGGGAACTCA TTTGGCTAAT TTAATTGATC ACGTCGGTTT TTTGACTAGA CCTACAGTGA ATGTTGAAGA GTTAAAAGAA CATTCCCCTC AACTGCGAGA TCTTAATGAA TGGTATCGAC AAAATGTTGG CAAGTTGGAA ATTAAAACTA AGGTTTTTTA TGAAACAAAG TCTCTGAATG GGGTTTTGGT TGTGAATGAG GATAGCGCTA ACCCTGGTAT TCACGATGTT AAACCTATTG CTGTTTCAGC AGAAGACCAC AACTCTATTG CTAAACCTGG AAAAAATGAT TTGGTTTATC TTTCGGTGAG AAAGTTTTGT CAAGATATTT TTGCTTTAGA GGAAAGCACT TCTTCTCAAT ATCTTCACCA AAAATACTAT ATTCCTGGGG AGGCTAGCTT AGAGAATAAA GTTTTTGTAG GTAGGGAAAA AGAATTAGCT GATGTCGACA AGTTACTCAA AAATTATCAG CGGGTTTCTA TTGTTTCTGT TTCTGGAATG GGTGGTGTTG GTAAAACAGA ACTTTGCCGC CGATATGCTT ATGCTCATAA GTCTGCTTAC CCTGGAGGTA TTTGTTGGTT GGAAGCGCCG ACAGAAAATG CAGGTATTCA GATACTTAGG TTTGCTCAGA ATAATTTCCA GCTTGTTTTC TCTTCAGATC AAGATTTACC GGAAAGGTTG CGCTACTGTT GGCAAAAGTG GTCTGAGGGG AATACTTTGC TGATCTATAA TGATGTAACT GACTACAATA CTCAGGCCAA GCCTTTTTTA CCTCCGGACT TGTCTCGGTT TCGGGTGTTG CTGACCACTC GTAAAAGTTT TGGCTCGGCA TTTCCGGAGT TGCGTCTCGA TGTGTTAAAG CCTTTGGCGG GGATGAAGTT GTTAAGGTCT ATTTTGGGTA GAAAGAGACT TTTGCTAGAA CCTAGGAAGG CTAGAGAACT GCTGATTTTT TTGGGTTATT TACCTTTGGG AATTGAATTG GTGGGGCGAT ATTTGGATGA GTATTGGCAG AGTTTGAATA GGGATGGTTT GGCTTTGACA AAAATGCTTA AGCGTTTAGA AAGAAAGAGT CTGGAACATC AGGCTATGTC ATCTAATGAG TTAATTAATT ATCCTTATGG TGTGGCGGAG GCGATCGCTT TGAGTTGGGA GATGTTGGAT GAAAATACTC AGGAAATTGG GTTGAAGTTA GGTTTATATG CTTTGGCACC TATTCGTTTG TGGTGGGATG GGATAGAAGA TGATGAAGAG TTGGAAGGTT GGGAAATTGC TCTTGGGAAT TTGGAAAATC TGCATTTACT GAAGAGTGTT GAGCCTGGTG TTTATATTTT GCATTCTCTA GTGCGAGAGT TTTTGCAGAT GAAGTTGAAA GAATATCCAA GGGCAGATGA GTTAAAGCGG GGTATTTGTC AGGTAGTGGC AGAGGCGGCT AGGAATATTC CTGATAACAT TACAGTGGAG CAAGTTAAGG AAGTTGAGGT TGATATTCCT CATATTACGG AAGTGGCGGC AGTTTTGACT GAGTATTTGA GTGATGATGA TTTGATTTCT CCTTTTAATG GACTAGGTCG GTTCTATCTA GGTCAAGCAT TGTACTCACA GGCACAACTT TGGTTAGAGA CAGGTAAAGA AATAGCAGAA AAACGGCTAG ACAAAGATAA TGCTGACATT GGGAATATCT ACAACAGTCT GGCTTTATTA TATAAGTCTC AAGGAAAATA CGAAGCTGCA GAACCTTTGT ACCTACAAGC TATTGAAACC GCAAAAATAG CCCTCCCTGA AAATCATCCA TCTATTGCCA CAGGCCTCAA CAACCTGGCA AATTTATATT ATTCTCAAGG AAAATACGAA GCAGCAGAAC CATTGTACCT ACAAGCTCTT GAAATCAAAA AAATAGCCCT CCCTGAAAAT CATCCACAAC GTGCCAGCGG CCTCAACAAC CTGGCAGGTT TATATTATTC TCAAGGAAAA TACGAAGCAG CAGAACCATT GTACCTACAA GCTCTTGAAA TCGACAAAAT AGCCCTCCCT GAAAATCATC CACAGTTTGC CACTCACCTC AACAACCTGG CAAAATTATA TAGATCTCAA GGAAAATACG AAGCTGCAGA ACCATTGTAC CTACAAGCTC TTGAAATCGA CAAAATAGCC CTCCCTGAAA ATCATCCACA GTTTGCCACT CACCTCAACA ACCTGGCAAA ATTATATAGA TCTCAAGGAA AATACGAAGC TGCAGAACCA TTGTACCTAC AAGCTCTTGA AATCAACAAA ATAGCCCTCC CTGAAAATCA TCCAGATATT GCCACTGACC TCAACAACCT GGCTTTATTA TATGAGTCTC AGGGAAAATA CGAAGCTGCA GAACCTTTGT ACCTACAAGC ACTGAAGATA TTAAAACAAT CATTAGGAGA AGAACATCCT AATACTCAAA CAGTTCAGAA AAACTATCAA AATTTCTTAA ATGAGAAAAA ATGA
|
Protein sequence | MLGFFLVFDC KNYPPQGDVV FVHGLAGHPW GTWHPQNKKD KEDVDFLLCW LGEELQAHGI DVNVWSFGYD APGFQYFGQG MPRFDLASNL LEYLQVYDIG KTSTSFPKRP LIFVTHSLGG LVVKEVIRTA QSFPEYLAIV KQVKGIVFLS TPHTGTHLAN LIDHVGFLTR PTVNVEELKE HSPQLRDLNE WYRQNVGKLE IKTKVFYETK SLNGVLVVNE DSANPGIHDV KPIAVSAEDH NSIAKPGKND LVYLSVRKFC QDIFALEEST SSQYLHQKYY IPGEASLENK VFVGREKELA DVDKLLKNYQ RVSIVSVSGM GGVGKTELCR RYAYAHKSAY PGGICWLEAP TENAGIQILR FAQNNFQLVF SSDQDLPERL RYCWQKWSEG NTLLIYNDVT DYNTQAKPFL PPDLSRFRVL LTTRKSFGSA FPELRLDVLK PLAGMKLLRS ILGRKRLLLE PRKARELLIF LGYLPLGIEL VGRYLDEYWQ SLNRDGLALT KMLKRLERKS LEHQAMSSNE LINYPYGVAE AIALSWEMLD ENTQEIGLKL GLYALAPIRL WWDGIEDDEE LEGWEIALGN LENLHLLKSV EPGVYILHSL VREFLQMKLK EYPRADELKR GICQVVAEAA RNIPDNITVE QVKEVEVDIP HITEVAAVLT EYLSDDDLIS PFNGLGRFYL GQALYSQAQL WLETGKEIAE KRLDKDNADI GNIYNSLALL YKSQGKYEAA EPLYLQAIET AKIALPENHP SIATGLNNLA NLYYSQGKYE AAEPLYLQAL EIKKIALPEN HPQRASGLNN LAGLYYSQGK YEAAEPLYLQ ALEIDKIALP ENHPQFATHL NNLAKLYRSQ GKYEAAEPLY LQALEIDKIA LPENHPQFAT HLNNLAKLYR SQGKYEAAEP LYLQALEINK IALPENHPDI ATDLNNLALL YESQGKYEAA EPLYLQALKI LKQSLGEEHP NTQTVQKNYQ NFLNEKK
|
| |