Gene Dole_2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2417 
Symbol 
ID5695265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2913887 
End bp2916175 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content58% 
IMG OID641265023 
ProductTPR repeat-containing protein 
Protein accessionYP_001530298 
Protein GI158522428 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATA CCCATCCCTC CCTGCCGCCG AGGCCCTGGT GGCCATTACT CCTTCTTGTC 
ATCATCGCGG CTGCCGCCAT TCTGACCTAT GCCAACTCTC TGACCGCGCC TTTCGTGTTT
GATGATGCCC ATAACATCGT TGAGAATCCT CATGTTCGCA TGGCCGAGCT TTCCCTCGAC
GGCATCATCG ACGCGGTCAC TGTCGGAACC CGGCGGCCGG TGGCCAGCCT CACCTTTGCA
CTCAACTACT ACTTTCACGG TTATGACATA ACCGGCTACC ATCTGGTCAA CATCGCCATC
CACATTCTGA CAGGCTTTCT GATGTTCCTG GTGGTCCGCC ACACACTGGG CCTGATGAAC
CTTGAAAGAC GGGACATGGT TGCCGGGCTT GCAGCCCTTG TGTGGACCGT CCACCCGCTT
CATACCCAGT CGGTGACCTA TGTCGTCCAG CGCATGGCCG CCCTGGCCAC CCTCTTTTTT
CTTTTCGCCC TCTACCTTTA TATAAGGGGC CGGCTCCGGC AAAGGGAAGG CCGCCCCCTT
GGCTCCCTGT TCTTTGTTCT CTGTGGCCTG TCCGGCATTC TGGCCGTGCT CTCCAAGCAG
ATCGCCGCCA CCCTGCCCTT TTTTATCCTG GTCTACGAAT GGTATTTTTT CCAAGACCTG
GACAGGGCGT GGATTAAAAA ACAGGCAAAA TGGATCGGCA GCGCAGTGGT TGTCATTGGC
GCGCTTGCGG CAATATACCT GGGCGCCTCC CCGATAGAAA AAATCATGGC CATGTATAAA
ACCCAGGACT TTACCCTGGG CCAACGGCTT TTAACCGAGC CCCGGGTGAT TTTTCTTTAT
CTTTCCCTGA TTCTTTTCCC CTATCCCGGG CGGCTCAACC TGGATTATGA CTTTCCCGTG
TCTGTCTCGC TGATCGATCC GGTTACCACC CTGGTCTCCA TACTGGGTTT GATCGGCCTG
GTCGCGGCCG CTGTTGTCAC GGCAAAAAGA TACCGGCTGG TCTCCTTTGC CGTTGTCTGG
TTTCTGGGTA ACCTGGCCAT TGAGTCCTCC TTTTTAGGCC TTGCCCTGGT GTTTGAACAC
CGCACCTACC TGCCCTCTGT CCTTGTAATC GCGGCCCTGG CCTGGGCCGC CATTACATAT
ATTCGGCCCC GGCCCCTGGC CATTGGTTTT CTCTGCGCCG TAACCCTCCT CTGGGGATTC
TGGACATACC AGCGTAACGC CATCTGGGCC GACGAAGTAG CCCTGTGGCG GGATGTGACA
GAAAAATCCC CCACCCTGGC CCGGCCCTGG AGCAACCTGG GAATGGCCCT GCAGATAGCG
GGAAACAGCG AGGCGGCGCT TCAGGCCTTT CAAAAGGCTA TTGCCCTTGA TCCGAATCAC
ATGGAGGCCC ACAACAACAG CGGTTTTATA TTAAGGGAGC TGGGCCGTCC CAAAGAGGCC
ATAAAGTTCT TCCGCAGGGC ACTGGAGATC AACCCGGCCT ATGCCGACGC CCACTACAAT
CTCGGCCTGG CCTTTTTTGA CTTAAAAGAC ATGGCCCAGG CCCGCACGGC TTTTGAGCAG
ACCCTTCGGG TCAATCCGCT CTACAGCAAG GCCCACAACA ACCTGGGGGT CATCCTGATG
CAGGAAGGTG ATCACGAGGC TGCTGTCGCC GCCTATCAGC GGGCACTTAA AACCGACCCC
CGTTTTGCCC AGGCCTACAA CAACCTGGGC ATTATCGCCT ATCAGCAGGG CAACCCGGAT
CAGGCGGCCT CGTTCTTTAA AAAAGCCCTG ACCGCGGATC CGGCCTATGC CGGCGCGGCC
AACAACCTGG CCCGGGTGCG GCAAACCATT GAAAAACACG GCCCCGCCAT CACGGAGCTT
AAACAGATGT TGCACAAGAC CCCCAACGAC GTGGATCTCT CCTGTCGCCT GGCCCAGGTT
TACCAGGCTG CCGGCATGCG GTACGGAGCC ATCTCACAAT ACCAAAAGGC CCTGGCTCTG
CAACCCGGCC ATGGACCAAG TCTCAACGCC CTTGGCGTCC TGTACGCCGC CATGGGCCAG
CCGGCAAAGG CTGTTGAGTG TTTTCGAAAG CTGTCGGCCC TGATGCCGGG CAATGCCACG
ATCTATTACA ACCTGGCCTG CCTGTACGCC CGGCAGAACC AGGTGGAACC GGCTGTTGAA
AACCTGAAAA AAGCCCTTGA CGCCGGGTAT GACAACCGGG AACAGATCCG CGCCGACAAG
GACCTTGCGC CCATTCGCGA CACCGAATTT TACAAAACCC GTATCGATTC ACAGGAGCCG
GGCAAATGA
 
Protein sequence
MTDTHPSLPP RPWWPLLLLV IIAAAAILTY ANSLTAPFVF DDAHNIVENP HVRMAELSLD 
GIIDAVTVGT RRPVASLTFA LNYYFHGYDI TGYHLVNIAI HILTGFLMFL VVRHTLGLMN
LERRDMVAGL AALVWTVHPL HTQSVTYVVQ RMAALATLFF LFALYLYIRG RLRQREGRPL
GSLFFVLCGL SGILAVLSKQ IAATLPFFIL VYEWYFFQDL DRAWIKKQAK WIGSAVVVIG
ALAAIYLGAS PIEKIMAMYK TQDFTLGQRL LTEPRVIFLY LSLILFPYPG RLNLDYDFPV
SVSLIDPVTT LVSILGLIGL VAAAVVTAKR YRLVSFAVVW FLGNLAIESS FLGLALVFEH
RTYLPSVLVI AALAWAAITY IRPRPLAIGF LCAVTLLWGF WTYQRNAIWA DEVALWRDVT
EKSPTLARPW SNLGMALQIA GNSEAALQAF QKAIALDPNH MEAHNNSGFI LRELGRPKEA
IKFFRRALEI NPAYADAHYN LGLAFFDLKD MAQARTAFEQ TLRVNPLYSK AHNNLGVILM
QEGDHEAAVA AYQRALKTDP RFAQAYNNLG IIAYQQGNPD QAASFFKKAL TADPAYAGAA
NNLARVRQTI EKHGPAITEL KQMLHKTPND VDLSCRLAQV YQAAGMRYGA ISQYQKALAL
QPGHGPSLNA LGVLYAAMGQ PAKAVECFRK LSALMPGNAT IYYNLACLYA RQNQVEPAVE
NLKKALDAGY DNREQIRADK DLAPIRDTEF YKTRIDSQEP GK