Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1273 |
Symbol | |
ID | 5694108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 1521805 |
End bp | 1524444 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641263867 |
Product | TPR repeat-containing protein |
Protein accession | YP_001529156 |
Protein GI | 158521286 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAACA GCACGTTCAA ACGCAGGCTT CTTCAGGCGA CGGCCCTGGT GGTCCTGGTG CTGGCCGTCT ACCAGCCATC CCTGCACAAT GGCTTTATCT GGGATGATGA TGCCTACGTC CATCAAAACC TTACGCTGAC AAGTCTTGAC GGGCTGAAAC GGATATGGCT GTCGCGGAGC GCCACGCCCC AGTACTACCC CATGGTCTTT TCCAGCTTCT GGGTCGAATA CCAAATATTT GGACCGGAGC CGATGGTGTT TCACCTTACC AACATGATGC TGCACGGCAT CAACGCCATC CTGGTCTGGC TGATCCTGAT GCGGCTGGGC CTGCCCTGGG CATGGCTGGC GGCCGCTGTT TTTGCTTTGC ACCCGGTCAA CGTGGAGTCC GTGGCATGGA TATCTGAACG CAAAAATGTG CTTTCCGGCC TTTTTGCGCT TTCCTCTCTT CTGCTTCTGG TCCGTCTTTA TCTGACCGAT ACCGATAAAA CCGTACAAAC GCCCCCTTCT CCGAAGAAGA ATGCGTATGC CCTCTATGGG GCTTCCTTTT TTCTTTTCAT CCTTGCCCTG CTGAGCAAAA GCGTCACCTG CATGATGCCG GTGGTGTTCC TGGTCCTGGT CTGGTGGAAA CGAGGCAAAA CCCCGCTTGG CACAATCGGC GCGACAGTTC CTTTTTTTAT CGCCGGCATC GTGGCGGGCA TCAACACCAG CCTGGTGGAA AAGCTCCATG TGGGGGCACA GGGTGCGGAT TGGGAGTTCA ACCTTCTTGA ACGGATGCTG ATTGCCGGCA GGGCCCTGTG GTTTTACGCA TACAAGTTAA TCTGGCCGTC GGAGATAATG TTCACCTACC CTCGATGGGA GATTGATTCC ACCGCCGGGT GGCAATACCT TTTTCCTGCT GCCGTTATTC TGCTTTTCGC GATTCTCTTT GCCGCAAAAA ACCGCGTCGG CAGAGGGCCG GTTGCCGGCG TGGCCATATT TGCAGTGACA CTGTTTCCGG CCCTTGGGTT TATCAGTTAT TTCCCCATGC TTTTTTCCTT TGTGGCGGAC CACTTTCAGT ACCTGGCCAC CATTGCTCTG ATTACGCTCG TCATTCAAGG ACTGCACCGC ATGACCAGCA CCGGGAGACG CCGGGCTCGA ACTCTCGCGA TGGGATTCTG TACTCTGACC CTTTTGGCGT TGGGGGTTCG CACCTGGCAG GAGCAGGACA AATACAAGAA CCTGCAAGCC CTGTGGGAGG ACACCATAAG AAAAAATCCC GACTGCTATC TGGCATTAAA CAACCTGGGG TGCGTGCTGA TGTCCCAGCC TGATAAGCTG GGCCAGGCCT ATGACATTTT TGCAGCCACT CTGAAGATGG GGCTAGATTA TCCCGAAACC CGTTTCAACC TGGCCCGCAC GCTTTTTTAT AAGGGCGACC ATGAAGCGGC AATACGCTAT TACACGGACC TGCTGGAGAA TAGCCCGGAA ATCTCACCCA AGCTGCTGGT GGATGTCCAT TACGACATGG CAATGATACT GGTCCATCGT GACCAGATCG AAGCGGCTGA AACTCATCTC CGGGCCGCTC TGGAGCTGAA ACCCTTTTTC CCGGAAGGAT ACAACGACCT GGGAGTGCTG CTGCGCCGGG CGGAACAGTT TGACAAGGCT ATTGACGCCT TTTCAGAAGC CCTTGCCATG AAGCCCGACT ATGCTGAAGC CCGGTACAAC CTGGCCCAGG TACTGTACGA CAGCGGCCAG GCCGAAAAAG CCATGGTCCA TTATAATTTC CTGGTCCACG ACGAGACAAC CGGCCCCGAC CTGCTGGCCA GCATTCATAA TGACCTGGGG GTTATTTATA CCCGCCGGCA CGAGCTGGAG GTTGCGAAAG ATCACCTTTC CCGTGCCCTG TGCCTGGCTC CCGACTCCCC CATGTTTCAC AACAACCTCG GCCTGCTGAT GATTAAGGCG GGCCGAAAGA GCACAGCACT GGAGTCCTTC TCAAGGGCCG TTGTCCTTGA TCCGGATTAC GCCGAGGCCC GTTACCAGCT AGCCCTGCTG TCGGCTGAGA CAGGTGATAC GGCAGCTGCC GTATCCCATT ACAATTATAT TCTGTCCAAC GCGCCGGCGG CAGATAACGT TAATCTTCTG GCCGATGTAT ACAACGGCAT GGGGAACATT TATGCTGAAC AAAACCAGAC AAAAAAGGCT ATTTACTATT TTAAACATGC CCTGGCTCTG AAACCCGATT TTGCTGAAGT CCATAACAAC ATGGCCCTGA CCCTGCTTTC TCTGAACAGA CGGCAGAAAG CCATTGATCA TTTTACCCGG GCACTTGCCA TTCAGCCCGG TTTTACGGAG GCGGCCAACA GCCTGGTGCT GACCTACAGT GCTGCCGGAG CGTATGACAA AGCCCTGACC GTGCTGAAGA ACCTGCTGGC CGCGGCGCCG GACAGCGCCG CCAGCATCAG CTACAACATT GCCTGCATCC ACTCCATCCG CGGAGACCTG GAAAATGCGG CCACATGGCT GAACAGGGCC ATTGACAGGG GGCTCCGTCT GCAACGCCTG CTGGAGACCG ACCCCGACCT GGAAAACATC AGAAAAAGCC AATACTACCC CGGCCTGCTG GAACGCATGC AAAAATCAAC CGAAAAATAA
|
Protein sequence | MANSTFKRRL LQATALVVLV LAVYQPSLHN GFIWDDDAYV HQNLTLTSLD GLKRIWLSRS ATPQYYPMVF SSFWVEYQIF GPEPMVFHLT NMMLHGINAI LVWLILMRLG LPWAWLAAAV FALHPVNVES VAWISERKNV LSGLFALSSL LLLVRLYLTD TDKTVQTPPS PKKNAYALYG ASFFLFILAL LSKSVTCMMP VVFLVLVWWK RGKTPLGTIG ATVPFFIAGI VAGINTSLVE KLHVGAQGAD WEFNLLERML IAGRALWFYA YKLIWPSEIM FTYPRWEIDS TAGWQYLFPA AVILLFAILF AAKNRVGRGP VAGVAIFAVT LFPALGFISY FPMLFSFVAD HFQYLATIAL ITLVIQGLHR MTSTGRRRAR TLAMGFCTLT LLALGVRTWQ EQDKYKNLQA LWEDTIRKNP DCYLALNNLG CVLMSQPDKL GQAYDIFAAT LKMGLDYPET RFNLARTLFY KGDHEAAIRY YTDLLENSPE ISPKLLVDVH YDMAMILVHR DQIEAAETHL RAALELKPFF PEGYNDLGVL LRRAEQFDKA IDAFSEALAM KPDYAEARYN LAQVLYDSGQ AEKAMVHYNF LVHDETTGPD LLASIHNDLG VIYTRRHELE VAKDHLSRAL CLAPDSPMFH NNLGLLMIKA GRKSTALESF SRAVVLDPDY AEARYQLALL SAETGDTAAA VSHYNYILSN APAADNVNLL ADVYNGMGNI YAEQNQTKKA IYYFKHALAL KPDFAEVHNN MALTLLSLNR RQKAIDHFTR ALAIQPGFTE AANSLVLTYS AAGAYDKALT VLKNLLAAAP DSAASISYNI ACIHSIRGDL ENAATWLNRA IDRGLRLQRL LETDPDLENI RKSQYYPGLL ERMQKSTEK
|
| |