Gene Dole_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1899 
Symbol 
ID5694739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2302880 
End bp2304151 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID641264497 
ProductYD repeat-containing protein 
Protein accessionYP_001529780 
Protein GI158521910 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.126482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGATCC CCAACCTGGG CTTTGTCACC ATCAGCGACT ACACATGGAA CCGGCCGGCT 
GCCATTACCC TGCCAGGCGG GGCCACCCGT GAGTTTGAAT ACGATCCCCT GATGCGGGTA
AAAGAGATCA CCTCTCTTGA CCCGGGCGGC AATGCCCTGT TAAATTACAC CTACGCCCAT
GACGCCATGG ACAACATCAC GGCCAAGCAG ACCGAGCACG GGGATTATGG GTATGGATAT
GACGATCTGC ACCGGCTGGC TACGGTTGAC AACCCGGCCG CGGGCCTGGC CGACGAGGCC
TTTACCTATG ACAGCGTGGG CAACCGCCTG ACCTCGGCCC AGGCGGCAGG AGACTGGACA
TACAATGACA ACAACGAATT GTTGTCATCC GTTGGAGTGA CCGGGGGATC CACATACGAG
TACGACGCCA ACGGCAATAC CATTAAAAAG ACAGTGGGCG GCGTTGTCAC CAGTTATGTA
TACAACACGG AAGACCGGCT GACCCAGGTC TGGAGCGGCC TGCCCGGTTC CGGTTCTTTG
ACAGCCACGT ACTATTATGA CCCGTTTGGC CGCAGGCTGT GGAAGGGGGT CGGGGGAACA
CGGACGTACT TCCATTACAG TGATGAGGGC CTCGTCGCGG AAATCAATGC CTCCGGAACC
GTGGTCAAGT CCTACGGCTG GCAGCCCGGC GGCACCTGGG GCACCGATCC GCTGTTCATG
AAGGTTAGTG GGAATTATTA CTTCTACCAC AATGACCACC TCGGTACCCC TCAGAAGCTG
ACGGCCAGTA ACGGGGCGGT GGTTTGGAGT GCTAAGTACG AGAGCTTTGG GGATGCGACT
GTTGAGATCG AGACGGTTGA GAATAACCTC AGGTTCCCGG GCCAATACTT TGATGGGGAG
AGTGGGCTGC ATTATAACCT GCATCGTTAT TATGCTCCTG AGCTAGGACG GTTCTTGAAA
GATGATCCAA TCGGACTTCG GGGTGGGATT AATCAATATA TTTATGCAGA TAACAATGTG
AGTAATAATA CTGATCCTTA CGGATTGTTT TCAAAAAAGA CTAAATGCCA GATAGCTTGT
AATGTGGCAT TAGGCTATAC TTGTACTGTT TTAGGTATTG GATCAGGCAT AGTCTCCGGG
CCATTAGTTG GAATTGGTGT TGGGGTTGTA TGCAGGGTAG TTTCATTTGG TATATGCTAT
GCAACTTGTT CAGGGGCACC AGATGATTGC TCAGACTTTC CACCGGGAGA CTATTCTTTT
TCTTATGCCT AA
 
Protein sequence
MLIPNLGFVT ISDYTWNRPA AITLPGGATR EFEYDPLMRV KEITSLDPGG NALLNYTYAH 
DAMDNITAKQ TEHGDYGYGY DDLHRLATVD NPAAGLADEA FTYDSVGNRL TSAQAAGDWT
YNDNNELLSS VGVTGGSTYE YDANGNTIKK TVGGVVTSYV YNTEDRLTQV WSGLPGSGSL
TATYYYDPFG RRLWKGVGGT RTYFHYSDEG LVAEINASGT VVKSYGWQPG GTWGTDPLFM
KVSGNYYFYH NDHLGTPQKL TASNGAVVWS AKYESFGDAT VEIETVENNL RFPGQYFDGE
SGLHYNLHRY YAPELGRFLK DDPIGLRGGI NQYIYADNNV SNNTDPYGLF SKKTKCQIAC
NVALGYTCTV LGIGSGIVSG PLVGIGVGVV CRVVSFGICY ATCSGAPDDC SDFPPGDYSF
SYA