Gene Dole_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1037 
Symbol 
ID5693872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1221777 
End bp1223237 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content62% 
IMG OID641263634 
Productfimbrial assembly family protein 
Protein accessionYP_001528924 
Protein GI158521054 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01709] general secretion pathway protein L 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.13729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGCA GGGTTCTGGG TATTGATATC GGCAACACCA CGGTCTGCGC GGTTCTGGTG 
GGCCCCGGCG AAAGCGGCCT TCGCATAGAG GCCCACGCCC ATGCGGCAAT GGGAAAGGAC
GTCTCCCCCT GGCAGGTGCT GCCCGGTGTG CTGGACGCCC TGGGTCACCG GGCCGACTGG
TCGGGTGCGG CCTGCGTGGC CACCTTCCCG GACCGGCAGG TCTCCTACAG GCGTATTCAT
ATGCCGTTTG CCGACCCCCG GAAGATCGAA AAGGTGATTG CCTATGAACT GGAGCCGCTG
CTGCCTTTTG CGCCGGCGGA CCTGTTTATC GATTTTGTGG TGCTTTTCCC GGCGGTTTCC
CAGGAAAACG AGGCGGGCGT CGAGGTGCTG GCCGCGGCGG TACACAAGGA AAAGGTGAAG
GAGTGGATTG ACGGGTTTGC CGGTGCCGGT CTGCAGCCGG AAATGATCGT GCCCAGAGGA
TGGGCCGCCG CCGCGGTGCT GTCGGCAGAT ATTGAGGAAG CCCTGCTGGT GGAGACTACC
GGTGGGTATG TGACAACGGC CGTGGTTGGC GGCGGGGATG CCCAATGGAT CCGGTTTTTT
TCAGCCGGCC CGCCGCCGGA TCAGGAGGCG GGCCGGTCTG CGCTGCGCGA CGGCCTGCGC
CAGACCCTGC TGGCCTATGA ATATGAAACA GGGGTTGCCG CGGGGCTGAA AAACGTCTTT
GTGACCGGTG CCGATTCCCG GGCAGGGGAG GCGGTAAAAA CGGTTGCCGA CTGCCTGGGA
ATTGATGCCC GGCCGTTTAA CATGGCGGAC GGCTCTTCCC TGGTGACGGG TGGCCGGCCC
GACCATACCT GGGAGCCGGG CCTTCTGGAC ATGGCGCTCT GCCCGGCGTT GCTGAAGAAA
AGCCGGAAGC CCTGTATTAA TTTTCGGAAA GGGGAGTTCT CCCTGGCGGC CAGGTGGTTG
CAATACCGGT ACCTGTTCGG TGGCACCGCC GCGCTGGCGG CACTGGTGCT GGTACTCGGC
CTGATGGGTT TTTTTTACGA CCTTTATTCT CTCAATCGGT CCATAGCGAC TGTGGATGCC
CGGCGGCAGG CGATTTTCCG GGAGTGCTTT CCCGAATCGG CGGCAGCACC GTCTGCCGAG
ATGATGCGCT ACGAGATCGA CAGGCTTCGT GAATCCAGTG GCCTGCCTGC CGGACTGGAC
CGGACTGTTT ACCAGGTGGA TATATTGAAC GACATCAGCC GCCTGGTGCC CCAGGAGACG
GATGTGGTGA TCACCCGCCT GGACGCGGGG CTCAATGATG TTTCCGTCAC CGGCAACACG
GATGCTTTTC AGTCCGTGGA CACCATGAAA AACAGGCTTG AGGAGAGTGA TCTGTTCGGG
ACCGTTGTCA TCAGTTCCGC CACCATGGAT AAAGCCGATA AACGGGTTCA CTTCAAGTTG
CAGGTGGAGC TGAAGTCATG A
 
Protein sequence
MDGRVLGIDI GNTTVCAVLV GPGESGLRIE AHAHAAMGKD VSPWQVLPGV LDALGHRADW 
SGAACVATFP DRQVSYRRIH MPFADPRKIE KVIAYELEPL LPFAPADLFI DFVVLFPAVS
QENEAGVEVL AAAVHKEKVK EWIDGFAGAG LQPEMIVPRG WAAAAVLSAD IEEALLVETT
GGYVTTAVVG GGDAQWIRFF SAGPPPDQEA GRSALRDGLR QTLLAYEYET GVAAGLKNVF
VTGADSRAGE AVKTVADCLG IDARPFNMAD GSSLVTGGRP DHTWEPGLLD MALCPALLKK
SRKPCINFRK GEFSLAARWL QYRYLFGGTA ALAALVLVLG LMGFFYDLYS LNRSIATVDA
RRQAIFRECF PESAAAPSAE MMRYEIDRLR ESSGLPAGLD RTVYQVDILN DISRLVPQET
DVVITRLDAG LNDVSVTGNT DAFQSVDTMK NRLEESDLFG TVVISSATMD KADKRVHFKL
QVELKS