Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1587 |
Symbol | |
ID | 4242736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2424735 |
End bp | 2427869 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106729 |
Product | NB-ARC |
Protein accession | YP_721339 |
Protein GI | 113475278 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.257434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGATA ATTTTGAGGA TAAAGTTGAT ATTAAGGAGA CTCATGGTGT TGGTGTTAAT TCAGGCAATA TTGTAAATTC AACTTTTGCT AAGACAATTA TTAATGACAC AAAAAATGTT TCATGGAAGG GTGAGCCAGC AGAGTTTCCT AATAATTTAA ATATATTACG AACTGGTGCT GTTAAATTTG TTGGTCGAGA TAAAGATATC GAAAATTTAC ATGAACAACT TCAGGAAAAA GAGCGTGTAT CTATTACAGC AGTAGTTACA GGTATGGCAG GCGTTGGGAA AACAGAATTA GCACTTCAAT ATTCTCTTTT ATCTGAAAAG GAGTTGAATT ATCCTGGTGG TATTTGTTGG ATAAATGTTA GGGAAAGAAG TGTGGGAGAA CAGTTATTAA GCTTTGCTCA AACTCAATTA GGATTATTTC CGGCTGAAGA TTGGAGTTTA GAAGAAAGAA TTAGCTTTTG TTGGTCAAAT TGGCAACCAC CTGGAGATGT TTTAATTGTT CTAGATGATG TTAATAAATA TGAAGAAATT GAGCAGTATT TACCTCCACA AAAACAACGT TTTAAATTAT TAATTACTAC TCGTAAATAT TGGTTGTCAG AATCTTTTTC ACAGTTACGT TTGGAGGTTT TGGATGAAGA TTCTGCTTTA GAATTATTAG AGGTCTTAAT TGGCAATTCT CGTTTAGGGA CACAAATAGA GGAAGCAAAG CAACTTTGTG AATGGTTGGG ATATTTACCC TTGGGATTAG AGTTGGTGGG GCGATTTCTG AAGAGGCGGT CTGAATGGAC ATTGGAAAGA ATGATACAAG AGTTGGAAAA ACAAGCTTTA AATTTCTCGG TGCTACAGAA CCCACCACAG GGAGAAATGA CAGCACAGCG TGGAGTAGCC GCTGCTTTTG AATTGAGTTG GAATGAATTA GATGAAAGGG GAAGGTATTT GGGCTGTTTG TTGAGTCTTT TTGCATTGGC TCCTATACCT TGGAATTTGG TGGAAAAATG TTTGTCTAAA GATGAAAGTC AGGAGAAAGG AATTATACAA AGGTGGTTTC CTACTTTTTC ACGTTTATGG TTATTATTGA TGCCTCCAAA AAAAGTTGAT GTATTAGATT CAAGAACTTG GGAAGATATT AGGGAAGATA CTTTGTTAGA TTTAAATTTA ATTCAAAAGA CAACACAGGG AACTTATGAA TTGCATCAAT TAGTACGTCG ATATTTTCAA GATAAGTTGA ATGCAATGAA GGAAGTAGAA CAGTTGAAGT CTCAGTTTTG TCGGGTAATT GTAGGTGCGG CAGAGAAAAT TCCTTATAAC AATGACATTA CAGTAGAACA AGTTAAAGAA GTTGAGATTG ATATACCTCA TATTACAGAA ATAGCAGACA ATTTGGCTGA ATATTTGAGC GATGATGATT TGATTATACC TTTTACAAGC TTAGGCTCAT TTTATCAAGG TCAAGGATTG TACCCACTGG CACAACCTTG GTTAGAGAAA GGTAAAGAAA TAGCTGAAAA ACGTTTAGAT AAAAATAATT CTGATATTGC AGCTATTTAC AACAACCTGG CATCATTATA TCGTGCACAA GGAAAATACG AAGCAGCTGA ACAATTGTAC CTACAAGCAA TAGAAATCCA CAAAATTGCC CTCCCTGAAA ATCATCCAGG TATTGCCACA CACCTCAACA ACCTGGCAAA TTTATATCGT GTACAAGGAA AATACGAAGC AGCAGAACCT TTGTTCCTAC AAGTAATAGA AATCCACAAA ATCGCCCTCC CTGAAAATCA TCCAAATATA GCCAGCGGCC TCAACAACCT GGCAGCATTA TATAAGTTAC AAGGAAAATA CGAAGCTGCA GAACCTTTGT TCCTACAAGC AATAGAAATC GACAAAATCG CCCTCCCTGA AAATCATCCA TCTCTTGCCA CAGACCTCAA CAACCTGGCA TTATTATATC ATTCACAAGG AAAATACGAA GCTGCAGAAC CTTTGTTCCT ACAAGCAATA GAAATCGACA AAATCGCCCT CCCTGAAAAT CATCCAAATA TAGCCAGCGG CCTCAACAAC CTGGCAGCAT TATATAAGTT ACAAGGAAAA TACGAAGCTG CAGAACCTTT GTACCTACAA GCAATAGAAA TCGACAAAAT CGCCCTCCCT GAAAATCATC CACAACGTGC CACACACCTC AACAACCTGG CAAATTTATA TCGTGCACAA GGAAAATACG AAGCAGCAGA ACCTTTGTAC CTACAAGCAA TAGAAATCCA CAAAATCGCC CTCCCTGAAA ATCATCCAGG TATTGCCACA CACCTCAACA ACCTGGCAAA TTTATATCGT GTACAAGGAA AATACGAAGC AGCAGAACCT TTGTTCCTAC AAGTAATAGA AATCCACAAA ATCGCCCTCC CTGAAAATCA TCCAAATATA GCCAGCGGCC TCAACAACCT GGCAGCATTA TATAAGTTAC AAGGAAAATA CGAAGCTGCA GAACCTTTGT TCCTACAAGC AATAGAAATC GACAAAATCG CCCTCCCTGA AAATCATCCA TCTCTTGCAA GAGACCTCAA CAACCTGGCA GAATTATATC GTGAACAAGG AAAATACGAA GCTGCAGAAC CTTTGTTCCT ACAAGCAATA GAAATCGACA AAATCGCCCT CCCTGAAAAT CATCCATCTC TTGCCACAGA CCTCAACAAC CTGGCATTAT TATATCATTC ACAAGGAAAA TACGAAGCTG CAGAACCTTT GTTTCTACAA GCAATAGAAA TCGACAAAAT CGCCCTCCCA GAAAATCATC CACAATTAGC CACACACCTC AACAACCTGG CAGGATTATA TCATGCACAA GGAAAATACG AAGCTGCAGA ACAATTGTAT CTACAAACAA TAGAAATCGA CAAAATCGCC CTCCCTGAAA ATCATCCATC TCTTGCAAGA GACCTCAACA ACCTGGCAGA ATTATATCGT GAACAAGGAA AATACGAAGC AGCTGAACCT TTGTACCTAC AAGCTATTGA AATATTTACA CAATCATTAG GTGAAGAACA TCCCAACACT CAAACAGTTC TGAAAAACTA TCAAATATTT TTAAATGAGA AAAATGAATC AAAACAAAAT CAAGATAAAT ATTAG
|
Protein sequence | MSDNFEDKVD IKETHGVGVN SGNIVNSTFA KTIINDTKNV SWKGEPAEFP NNLNILRTGA VKFVGRDKDI ENLHEQLQEK ERVSITAVVT GMAGVGKTEL ALQYSLLSEK ELNYPGGICW INVRERSVGE QLLSFAQTQL GLFPAEDWSL EERISFCWSN WQPPGDVLIV LDDVNKYEEI EQYLPPQKQR FKLLITTRKY WLSESFSQLR LEVLDEDSAL ELLEVLIGNS RLGTQIEEAK QLCEWLGYLP LGLELVGRFL KRRSEWTLER MIQELEKQAL NFSVLQNPPQ GEMTAQRGVA AAFELSWNEL DERGRYLGCL LSLFALAPIP WNLVEKCLSK DESQEKGIIQ RWFPTFSRLW LLLMPPKKVD VLDSRTWEDI REDTLLDLNL IQKTTQGTYE LHQLVRRYFQ DKLNAMKEVE QLKSQFCRVI VGAAEKIPYN NDITVEQVKE VEIDIPHITE IADNLAEYLS DDDLIIPFTS LGSFYQGQGL YPLAQPWLEK GKEIAEKRLD KNNSDIAAIY NNLASLYRAQ GKYEAAEQLY LQAIEIHKIA LPENHPGIAT HLNNLANLYR VQGKYEAAEP LFLQVIEIHK IALPENHPNI ASGLNNLAAL YKLQGKYEAA EPLFLQAIEI DKIALPENHP SLATDLNNLA LLYHSQGKYE AAEPLFLQAI EIDKIALPEN HPNIASGLNN LAALYKLQGK YEAAEPLYLQ AIEIDKIALP ENHPQRATHL NNLANLYRAQ GKYEAAEPLY LQAIEIHKIA LPENHPGIAT HLNNLANLYR VQGKYEAAEP LFLQVIEIHK IALPENHPNI ASGLNNLAAL YKLQGKYEAA EPLFLQAIEI DKIALPENHP SLARDLNNLA ELYREQGKYE AAEPLFLQAI EIDKIALPEN HPSLATDLNN LALLYHSQGK YEAAEPLFLQ AIEIDKIALP ENHPQLATHL NNLAGLYHAQ GKYEAAEQLY LQTIEIDKIA LPENHPSLAR DLNNLAELYR EQGKYEAAEP LYLQAIEIFT QSLGEEHPNT QTVLKNYQIF LNEKNESKQN QDKY
|
| |