Gene Dole_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1287 
Symbol 
ID5694122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1538325 
End bp1539806 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content52% 
IMG OID641263881 
Productcapsular polysaccharide biosynthesis protein-like protein 
Protein accessionYP_001529170 
Protein GI158521300 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTGT TTCGAGACAA CAGATCGATG CGGCACGATG CCTTACGTCA CCCGCTGCCT 
TCGAATATTG CCATCTGGTC ACCATCCGAT GATGATGGCG TTTTCCCGCT CGATGGCTTG
CGCCCCGCTA TCCGTCATAT AAACACGTCA TCTAAAAAAA AAGGCGACCC CCTTGCCAGG
CTGCGGGTGA CCTGGATGGG AAGGTTTTAC ACAAAGCATT TCAAGCGGTA CGCCTTTATT
CGATGGCTGG CGTCCTGGCT GTGGCGCAAC CTGTATCCGG TTTATCTCAA TTTTATTGCG
GGCCTTTCCA TTTATTTCCG TAACAGGAAG GCGATGAGAC GGCGGCGCCT GATAACCCTG
ACGGAATACG CGGCGGCGCA GCATCTGCAG AAAGTAAAAC TGGCCGATGC CGGCCTGGTA
AAAACACCGG CACCGGCAGT CTATCCTGCC GGTGACCGGG ACTGCCTGCA ATCACCTCAC
GAGGAGTACA TATTTCCGGA AGTGTTTGTC GCCAGCATTA AAAACGGTAT GGTCTATGGC
GGCACCAATC TTATATTGCT TGAAGACCAG GTGATTTGCC ACGACCTGTA CGATTTCAAA
CGGGACTACA CATCCGAGGA GATGCACGGC CGGACAGTGA TTGACCCCGC AGGCCGCCGT
ATTCGATGGA TACTCCATGA TGAGACGCCT GAACAGATTC CCATGGCCGC CGCCTTTGTC
GATGCCTGCG CGCCTAACTA TGCCCACTGG ATGACCGAGG TACTGCCCCG TATCGCCCTT
TTTTGTAATG AGCCCCGGTT TAATCGTATT CCCATTGTGG TGAACGACGG GCTTCATGAA
AACATAATGG AATCGCTGTT CTTCGTGGCC GGTCCGGAGC GTGAAATCAT CACCCTGCCG
ATCGGCAGGG CCTTGGCCAT CGACACCCTG TACCTAACAT CAGTGGCCGG TTATGTGCCT
TTTGAACGTC GCACCACCAA GCTTTCGGGA CACTCTCACG GCAGGTTCAG CCCACGGGCG
TTTGAACTGC TACGCGAGCG CATGGCAGTG CTTGACCCAA AGACCGGGAA CCGGGACTGG
CCGGAAAAAA TTGTTTTGCA CCGGAATTCG GGTTACAGAA AGGTCGTCAA TATAGATGAA
ATCGAAAGTG AACTGGTCGG CCGGGGTTAT GCTGTTGTTC AGCCTGAAAA GCTGACCTTT
TTACAGCAGG TCCACCTGTT CAGCCATGTA AAGCATATCG TGGGATCATC CGGTTCGGCC
CTGGCAAACA TGATGTTTGC TCCAAAAGAT GCGAAAATAA TAATCCTTTT GAATAAACAT
CCGGATACCA GTTACTGGTA TTGGCAGAAC ATGGCCTGCA CCTGCGGTAA CAGGATTCAC
TACGTGCTGG GAAAGTCGCG TGACATCGGT AACAACGGCA TCCATGCTGA TTTTGAAATC
CGCATGGATC ATCTGATTGC GTCGATAGAG GGAGAATCAT GA
 
Protein sequence
MPLFRDNRSM RHDALRHPLP SNIAIWSPSD DDGVFPLDGL RPAIRHINTS SKKKGDPLAR 
LRVTWMGRFY TKHFKRYAFI RWLASWLWRN LYPVYLNFIA GLSIYFRNRK AMRRRRLITL
TEYAAAQHLQ KVKLADAGLV KTPAPAVYPA GDRDCLQSPH EEYIFPEVFV ASIKNGMVYG
GTNLILLEDQ VICHDLYDFK RDYTSEEMHG RTVIDPAGRR IRWILHDETP EQIPMAAAFV
DACAPNYAHW MTEVLPRIAL FCNEPRFNRI PIVVNDGLHE NIMESLFFVA GPEREIITLP
IGRALAIDTL YLTSVAGYVP FERRTTKLSG HSHGRFSPRA FELLRERMAV LDPKTGNRDW
PEKIVLHRNS GYRKVVNIDE IESELVGRGY AVVQPEKLTF LQQVHLFSHV KHIVGSSGSA
LANMMFAPKD AKIIILLNKH PDTSYWYWQN MACTCGNRIH YVLGKSRDIG NNGIHADFEI
RMDHLIASIE GES