Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2475 |
Symbol | |
ID | 5695325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 3002260 |
End bp | 3004158 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641265083 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_001530356 |
Protein GI | 158522486 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00625277 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGAC ATCTTTTCAT CGCCGCTGCT TTAACCTTCC TGCTGACCGC CGGATGCCCG GCGCCGGTCA CCACCCTGCA AAAGGGCGGT CCGGCAAAAC CGGCCCGCAC CCCGTCGGCC AACAGTTACT ATCTTTTCAC CCAGGCCCAG ATTGAAAAAG AGAAAGGGAG CCTGGATTCG GCCACTGCCT GGATGACCAG GGCCGTGGCC GCGGACCCGG ACTCGGCCTA CCTGAAAAGA GAGCTGGCCA TTCTCTTTTT GATGAAAAAG GAGAACGACC GGGCACGGCA GACGGTTGAA CAGCTGCTGG CGGTCCATCC CGACGATGTG GACGGACAGA TCCTGCTGGC CGGTATCCTG CACCATCAGG GCGACCTCCA GGGCGCGGCC CGCCTTTACG AGCAGGCCCT TGAAAACGAC CCCGACCAGG AAGGCCTCTA TCTTGTGCTG GGCAACCTTT ACACGGAACA GGGGCAGATG GAATCGGCCG CTGGCGTTTA CGAAAAAATG ACCCGGCATT TTCCAGACCT ATGGGACGGC CATTTCTTCC TGGGCAACAC CCGCAAAGAG ATGGGCCTTG CAAAAGAAGC GGAGAAAAGC TACAAAACAG CCATCCGGCT CAACCCGGAA GCCTTAAGCC CGCGGTTTGC CCTCCTGGAC CTGTACGAAC GGCAGACGCC GCCCACCGGG CCCGTCACGG TAACGGTAAC GTCCGGGGAC ACCCTGTTTT CCCTCTGCCG CCAGGTCTAC GGCAACTGCT CCCGGCGCCT GCTGGACCGT ATTGCCGCAG CAAACCCTTC CATCACCGAC ATGGACCGGC TGAACGTGGG CCAGACCCTT CAAATGCCGC CACAGGAAGG CTCCGCGCAC AACCGGCGGC AGGTTATCGA CCTGTATACC GACCTGCTGC GTGATAACCC GGACAATTAC CGGGCCGCCT TTGGCCTGGC CCTTCACCAC CATGCAGCCG GAGACGTCGA CGCGGCCCTG AAGATACTAA AGCCCCTGGG GCCCAAAAGT GATGAAACCC CGGGCGTGCT CCAGCCCCTG TTTCAGTACT ATATCGACCG GGGGAAATAC CCGGAGGCCG AGATCCTGGT CAACGGCATG CTGACCGGCG CCCCGGACAG CTCGGCCTTG AACTATCTGA TGGGCATGGT GCAGGACCAG CGGGAAAATA AAGAAGCGGC TATTCGATTT CTCGCAAAGG TCCGCCCCAA AAGCCGGTTC TACGACAATG CCCGGTTTCA CATGGCCCTG CTCCACCAGA GCATGGGAAA TACCGGTCAG GCCATAGAGA TCCTTGCCGC CCGCATCACC GACGAACCCG ACGATGTCGA CCACCTGCTC CGGCTGGGGG TGCTCTACGA AGAGGAGGAA GAATACGGGA AGGCGGAAGA TCTGTTTGAA CGGGGCCTTG CCATAAATCC GGACCATGTG GAACTGCTCT TTCGCCTGGG CGTGGTCTAC GATAAAACCG ACCGCAAAGA GGCCCTGATC ACCCAGATGG AAAAAGTGAT CGAAAAAGAC CCGGACAACG CCGGTGCCCT GAACTACCTG GGATACACCT ACGCGGAAAA GGGGGAGAAC CTTGACCAGG CCCAGGCCCT CATTGAAAAG GCACTGGCCC TTCAACCCGA TGACGGCTAT ATCACCGACA GCCTGGGGTG GGTCTATTTC AAAAAGGGAA ATGTCGAGAA GGCCGTTTAC TACCTGGAGG CAGCGGTAAG CCTGGTGCCC GACGACCCGG TGCTGCTGGA GCACCTGGGG GACGCCTACC GGGAACAGGG AAACACGGAA AAGGCCCTGG AGATGTACCG GCGCAGCCTG GCCAACCAGG AAAAGGACAC AACGGGAATA AAGGCCAAGA TTGAGGCCCT GCAAAAAGAG CTGCCATGA
|
Protein sequence | MIRHLFIAAA LTFLLTAGCP APVTTLQKGG PAKPARTPSA NSYYLFTQAQ IEKEKGSLDS ATAWMTRAVA ADPDSAYLKR ELAILFLMKK ENDRARQTVE QLLAVHPDDV DGQILLAGIL HHQGDLQGAA RLYEQALEND PDQEGLYLVL GNLYTEQGQM ESAAGVYEKM TRHFPDLWDG HFFLGNTRKE MGLAKEAEKS YKTAIRLNPE ALSPRFALLD LYERQTPPTG PVTVTVTSGD TLFSLCRQVY GNCSRRLLDR IAAANPSITD MDRLNVGQTL QMPPQEGSAH NRRQVIDLYT DLLRDNPDNY RAAFGLALHH HAAGDVDAAL KILKPLGPKS DETPGVLQPL FQYYIDRGKY PEAEILVNGM LTGAPDSSAL NYLMGMVQDQ RENKEAAIRF LAKVRPKSRF YDNARFHMAL LHQSMGNTGQ AIEILAARIT DEPDDVDHLL RLGVLYEEEE EYGKAEDLFE RGLAINPDHV ELLFRLGVVY DKTDRKEALI TQMEKVIEKD PDNAGALNYL GYTYAEKGEN LDQAQALIEK ALALQPDDGY ITDSLGWVYF KKGNVEKAVY YLEAAVSLVP DDPVLLEHLG DAYREQGNTE KALEMYRRSL ANQEKDTTGI KAKIEALQKE LP
|
| |