Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1037 |
Symbol | |
ID | 5693872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1221777 |
End bp | 1223237 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641263634 |
Product | fimbrial assembly family protein |
Protein accession | YP_001528924 |
Protein GI | 158521054 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01709] general secretion pathway protein L |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.13729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGCA GGGTTCTGGG TATTGATATC GGCAACACCA CGGTCTGCGC GGTTCTGGTG GGCCCCGGCG AAAGCGGCCT TCGCATAGAG GCCCACGCCC ATGCGGCAAT GGGAAAGGAC GTCTCCCCCT GGCAGGTGCT GCCCGGTGTG CTGGACGCCC TGGGTCACCG GGCCGACTGG TCGGGTGCGG CCTGCGTGGC CACCTTCCCG GACCGGCAGG TCTCCTACAG GCGTATTCAT ATGCCGTTTG CCGACCCCCG GAAGATCGAA AAGGTGATTG CCTATGAACT GGAGCCGCTG CTGCCTTTTG CGCCGGCGGA CCTGTTTATC GATTTTGTGG TGCTTTTCCC GGCGGTTTCC CAGGAAAACG AGGCGGGCGT CGAGGTGCTG GCCGCGGCGG TACACAAGGA AAAGGTGAAG GAGTGGATTG ACGGGTTTGC CGGTGCCGGT CTGCAGCCGG AAATGATCGT GCCCAGAGGA TGGGCCGCCG CCGCGGTGCT GTCGGCAGAT ATTGAGGAAG CCCTGCTGGT GGAGACTACC GGTGGGTATG TGACAACGGC CGTGGTTGGC GGCGGGGATG CCCAATGGAT CCGGTTTTTT TCAGCCGGCC CGCCGCCGGA TCAGGAGGCG GGCCGGTCTG CGCTGCGCGA CGGCCTGCGC CAGACCCTGC TGGCCTATGA ATATGAAACA GGGGTTGCCG CGGGGCTGAA AAACGTCTTT GTGACCGGTG CCGATTCCCG GGCAGGGGAG GCGGTAAAAA CGGTTGCCGA CTGCCTGGGA ATTGATGCCC GGCCGTTTAA CATGGCGGAC GGCTCTTCCC TGGTGACGGG TGGCCGGCCC GACCATACCT GGGAGCCGGG CCTTCTGGAC ATGGCGCTCT GCCCGGCGTT GCTGAAGAAA AGCCGGAAGC CCTGTATTAA TTTTCGGAAA GGGGAGTTCT CCCTGGCGGC CAGGTGGTTG CAATACCGGT ACCTGTTCGG TGGCACCGCC GCGCTGGCGG CACTGGTGCT GGTACTCGGC CTGATGGGTT TTTTTTACGA CCTTTATTCT CTCAATCGGT CCATAGCGAC TGTGGATGCC CGGCGGCAGG CGATTTTCCG GGAGTGCTTT CCCGAATCGG CGGCAGCACC GTCTGCCGAG ATGATGCGCT ACGAGATCGA CAGGCTTCGT GAATCCAGTG GCCTGCCTGC CGGACTGGAC CGGACTGTTT ACCAGGTGGA TATATTGAAC GACATCAGCC GCCTGGTGCC CCAGGAGACG GATGTGGTGA TCACCCGCCT GGACGCGGGG CTCAATGATG TTTCCGTCAC CGGCAACACG GATGCTTTTC AGTCCGTGGA CACCATGAAA AACAGGCTTG AGGAGAGTGA TCTGTTCGGG ACCGTTGTCA TCAGTTCCGC CACCATGGAT AAAGCCGATA AACGGGTTCA CTTCAAGTTG CAGGTGGAGC TGAAGTCATG A
|
Protein sequence | MDGRVLGIDI GNTTVCAVLV GPGESGLRIE AHAHAAMGKD VSPWQVLPGV LDALGHRADW SGAACVATFP DRQVSYRRIH MPFADPRKIE KVIAYELEPL LPFAPADLFI DFVVLFPAVS QENEAGVEVL AAAVHKEKVK EWIDGFAGAG LQPEMIVPRG WAAAAVLSAD IEEALLVETT GGYVTTAVVG GGDAQWIRFF SAGPPPDQEA GRSALRDGLR QTLLAYEYET GVAAGLKNVF VTGADSRAGE AVKTVADCLG IDARPFNMAD GSSLVTGGRP DHTWEPGLLD MALCPALLKK SRKPCINFRK GEFSLAARWL QYRYLFGGTA ALAALVLVLG LMGFFYDLYS LNRSIATVDA RRQAIFRECF PESAAAPSAE MMRYEIDRLR ESSGLPAGLD RTVYQVDILN DISRLVPQET DVVITRLDAG LNDVSVTGNT DAFQSVDTMK NRLEESDLFG TVVISSATMD KADKRVHFKL QVELKS
|
| |