Gene Dole_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2475 
Symbol 
ID5695325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3002260 
End bp3004158 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content60% 
IMG OID641265083 
Productpeptidoglycan-binding LysM 
Protein accessionYP_001530356 
Protein GI158522486 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00625277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGAC ATCTTTTCAT CGCCGCTGCT TTAACCTTCC TGCTGACCGC CGGATGCCCG 
GCGCCGGTCA CCACCCTGCA AAAGGGCGGT CCGGCAAAAC CGGCCCGCAC CCCGTCGGCC
AACAGTTACT ATCTTTTCAC CCAGGCCCAG ATTGAAAAAG AGAAAGGGAG CCTGGATTCG
GCCACTGCCT GGATGACCAG GGCCGTGGCC GCGGACCCGG ACTCGGCCTA CCTGAAAAGA
GAGCTGGCCA TTCTCTTTTT GATGAAAAAG GAGAACGACC GGGCACGGCA GACGGTTGAA
CAGCTGCTGG CGGTCCATCC CGACGATGTG GACGGACAGA TCCTGCTGGC CGGTATCCTG
CACCATCAGG GCGACCTCCA GGGCGCGGCC CGCCTTTACG AGCAGGCCCT TGAAAACGAC
CCCGACCAGG AAGGCCTCTA TCTTGTGCTG GGCAACCTTT ACACGGAACA GGGGCAGATG
GAATCGGCCG CTGGCGTTTA CGAAAAAATG ACCCGGCATT TTCCAGACCT ATGGGACGGC
CATTTCTTCC TGGGCAACAC CCGCAAAGAG ATGGGCCTTG CAAAAGAAGC GGAGAAAAGC
TACAAAACAG CCATCCGGCT CAACCCGGAA GCCTTAAGCC CGCGGTTTGC CCTCCTGGAC
CTGTACGAAC GGCAGACGCC GCCCACCGGG CCCGTCACGG TAACGGTAAC GTCCGGGGAC
ACCCTGTTTT CCCTCTGCCG CCAGGTCTAC GGCAACTGCT CCCGGCGCCT GCTGGACCGT
ATTGCCGCAG CAAACCCTTC CATCACCGAC ATGGACCGGC TGAACGTGGG CCAGACCCTT
CAAATGCCGC CACAGGAAGG CTCCGCGCAC AACCGGCGGC AGGTTATCGA CCTGTATACC
GACCTGCTGC GTGATAACCC GGACAATTAC CGGGCCGCCT TTGGCCTGGC CCTTCACCAC
CATGCAGCCG GAGACGTCGA CGCGGCCCTG AAGATACTAA AGCCCCTGGG GCCCAAAAGT
GATGAAACCC CGGGCGTGCT CCAGCCCCTG TTTCAGTACT ATATCGACCG GGGGAAATAC
CCGGAGGCCG AGATCCTGGT CAACGGCATG CTGACCGGCG CCCCGGACAG CTCGGCCTTG
AACTATCTGA TGGGCATGGT GCAGGACCAG CGGGAAAATA AAGAAGCGGC TATTCGATTT
CTCGCAAAGG TCCGCCCCAA AAGCCGGTTC TACGACAATG CCCGGTTTCA CATGGCCCTG
CTCCACCAGA GCATGGGAAA TACCGGTCAG GCCATAGAGA TCCTTGCCGC CCGCATCACC
GACGAACCCG ACGATGTCGA CCACCTGCTC CGGCTGGGGG TGCTCTACGA AGAGGAGGAA
GAATACGGGA AGGCGGAAGA TCTGTTTGAA CGGGGCCTTG CCATAAATCC GGACCATGTG
GAACTGCTCT TTCGCCTGGG CGTGGTCTAC GATAAAACCG ACCGCAAAGA GGCCCTGATC
ACCCAGATGG AAAAAGTGAT CGAAAAAGAC CCGGACAACG CCGGTGCCCT GAACTACCTG
GGATACACCT ACGCGGAAAA GGGGGAGAAC CTTGACCAGG CCCAGGCCCT CATTGAAAAG
GCACTGGCCC TTCAACCCGA TGACGGCTAT ATCACCGACA GCCTGGGGTG GGTCTATTTC
AAAAAGGGAA ATGTCGAGAA GGCCGTTTAC TACCTGGAGG CAGCGGTAAG CCTGGTGCCC
GACGACCCGG TGCTGCTGGA GCACCTGGGG GACGCCTACC GGGAACAGGG AAACACGGAA
AAGGCCCTGG AGATGTACCG GCGCAGCCTG GCCAACCAGG AAAAGGACAC AACGGGAATA
AAGGCCAAGA TTGAGGCCCT GCAAAAAGAG CTGCCATGA
 
Protein sequence
MIRHLFIAAA LTFLLTAGCP APVTTLQKGG PAKPARTPSA NSYYLFTQAQ IEKEKGSLDS 
ATAWMTRAVA ADPDSAYLKR ELAILFLMKK ENDRARQTVE QLLAVHPDDV DGQILLAGIL
HHQGDLQGAA RLYEQALEND PDQEGLYLVL GNLYTEQGQM ESAAGVYEKM TRHFPDLWDG
HFFLGNTRKE MGLAKEAEKS YKTAIRLNPE ALSPRFALLD LYERQTPPTG PVTVTVTSGD
TLFSLCRQVY GNCSRRLLDR IAAANPSITD MDRLNVGQTL QMPPQEGSAH NRRQVIDLYT
DLLRDNPDNY RAAFGLALHH HAAGDVDAAL KILKPLGPKS DETPGVLQPL FQYYIDRGKY
PEAEILVNGM LTGAPDSSAL NYLMGMVQDQ RENKEAAIRF LAKVRPKSRF YDNARFHMAL
LHQSMGNTGQ AIEILAARIT DEPDDVDHLL RLGVLYEEEE EYGKAEDLFE RGLAINPDHV
ELLFRLGVVY DKTDRKEALI TQMEKVIEKD PDNAGALNYL GYTYAEKGEN LDQAQALIEK
ALALQPDDGY ITDSLGWVYF KKGNVEKAVY YLEAAVSLVP DDPVLLEHLG DAYREQGNTE
KALEMYRRSL ANQEKDTTGI KAKIEALQKE LP